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8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, AND 67084 ALT, 
HUMAN, PROTEINS AND METHODS OF USE THEREOF 

» I 

5 Related Applications 

This application claims the benefit of prior-filed provisional patent application Serial 
No. 60/256,240, filed December 15, 2000, entitled "8099- AND 46455, NOVEL HUMAN 
SUGAR TRANSPORTERS AjND USES THEREFOR," prior-filed provisional patent , 
application Serial No. 60/256,588, filed December 1 8, 2000, entitled "54414 AND 53763, 
10 NOVEL HVMAN POTASSIUM CHANNELS AND USES THEREFOR," and prior-filed 
provisional patent application Serial No. 60/258,028, filed December 21, 2000, entitled 
"67076, 67102, 44181, 67084FL, and 67084alt, NOVEL HUMAN PHOSPHOLIPID 
TRANSPORTERS AND USES THEREOF." The entire contents of the above-referenced 
applications are incorporated herein by this reference. 1 

^ 15 

Background of the Invention > ■ , 

Cellular membranes serve to differentiate the contents of a cell from the surrounding 
environment, and may also serve as effective barriers against the unregulated influx of 
hazardous or unwanted compounds, and the unregulated efflux of desirable compounds. 

20 Membranes are by nature impervious to the unfacilitated diffusion of hydrophilic 

compounds such as proteins, water molecules, and ions due to their structure: a bilayer of 
lipid molecules in which the polar head groups face outward (towards the exterior and 
interior of the cell) and the nonpolar tails face inward (at the center of bilayer, forming a 
hydrophobic core). Membranes enable a cell to maintain a relatively higher intracellular 

25 concentration of desired compounds and a relatively lower intracellular concentration of 
undesired compounds than are contained within the surrounding environment. 

Membranes also present a structural difficulty for cells, in that most desired 
compounds cannot readily enter the cell, nor can most waste products readily exit the cell 
through this lipid bilayer. The import and export of such compounds is regulated by 

30 proteins which are embedded (singly or in complexes) in the cellular membrane. Two 

mechanisms exists whereby membrane proteins allow the passage of compounds: non- 

- ■ i 

mediated and mediated transport. Simple diffusion is an example of non-mediated transport, 
while facilitated diffusion and active transport are examples of mediated transport. 
Permeases, porters, translocases, translocators, and transporters are proteins that engage in 
35 mediated transport (Voet and Voet (1990) Biochemistry, John Wiley and Sons, Inc., New 
York, N.Y. pp. 484-505). 

Sugar transporters are members of the major facilitator superfamily of transporters. 
These transporters are passive in the sense that they are driven by the substrate concentration 
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gradient arid they exhibit distinct kinetics as well as sugar substrate specificity. Members of 
this faniily share several characteristics: (1) they contain twelve transmembrane domains 
separated by hydrophilic loopi; (2) they have intracellular "N- and C-termfni; and (3) they are 
thought to function as oscillating pores. The transport mechanism occurs via sugar binding 
5 to the exofacial binding site of the transporter, which is thought to trigger a conformational 
change causing the sugar binding site to re-orient to the endofacial conformation, allowing 
the release of substrate. These transporters are speqific for various sugars and are found in 
both prokaryotes and eukaryotfes. In mammals, sugar transporters transport various 
monosaccharides 'across the cell membrane (Walmsley et at (1 998) Trends in Biochem. Sci. 

10 23:476-4811 Barrett efal (1999) > Cwrr. Op. Cell Biol. 11:496-502). 

At least nine mammalian glucose transporters have been identified, GLUT1 - 
GLUT9, which are expressed in a tissue-specific manner (e.g., in brain, erythrocyte, kidney, 
muscle, and adipose tissues) (Shepherd et al. (199$) N. Engl J. Afed. 341 :248-257; Doege et 
al (2000) Biochem. J. 350:771-776). Some GLUT proteins have been shown to be present 

15 in low amounts at the plasma, mfefnbrane during the basal state, at which time large amounts 
are sequestered, in intracellular vesicle stores. Stimulatory molecules specific for each 
GLUT (such as insulin) regulate the translocation of the GLUT-containing vesicles to the 
plasma membrane. The vesicles fuse at the membrane and subsequently expose the GLUT 
protein to the extracellular milieu to allow glucose (and other monosaccharide) transport 

20 into the cell (Walmsley et al (1 998) Trends in Biochem. Sci. 23 :476-48 1 ; Barrett et al 
(1999) Curr. Op. Cell Biol 1 1 :496-502). Other GLUT transporters play a role in 
constitutive sugar transport. 

Potassium (K + ) channels are ubiquitous proteins which are involved in the setting of 
the resting membrane potential as well as in the modulation of the electrical activity of cells. 

25 In excitable cells, K + channels influence action potential waveforms, firing frequency, and 
neurotransmitter secretion (Rudy, B. (1988) Neuroscience, 25, 729-749; Hille, B. (1992) 
Ionic Channels of Excitable Membranes, 2nd Ed.). In non-excitable cells, they are involved 
in hormone secretion, cell volume regulation and potentially in cell proliferation and 
differentiation (Lewis et al (1995) Anmi. Rev. Immunol, 13, 623-653). Developments in 

30 electrophysiology have allowed the identification and the characterization of an astonishing 
variety of K + channels that differ in their biophysical properties, pharmacology, regulation 
and tissue distribution (Rudy, B. (1988) Neuroscience, 25, 729-749; Hille, B. (1992) Ionic 
Channels of Excitable Membranes, 2nd Ed.). More recently, cloning efforts have shed 
considerable light on the mechanisms that determine this functional diversity. Furthermore, 

35 analyses of structure-function relationships have provided an important set of data 

concerning the molecular basis of the biophysical properties (selectivity, gating, assembly) 
and the pharmacological properties of cloned K + channels. 
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Functional diversity of K + channels arises mainly from the existence of a great 
number of genes coding for pore-forming subunits, as well as for other associated regulatory 
subunits. Two main structural families of pore-forming subunits have been identified. The 
first ohe consists of subunits with a conserved hydrophobic core containing six 
5 transmembrane domains (TMDs). These K + channel a subunits participate in the formation 
of outward rectifier voltage-gated (Kv) and Ca^dependent K + channels. The fourth TMD 
contains repeated positive charges involved in- the voltage gating of these channels and 
hence in their outward rectification (Logothetis et ah (1992)' Neuron, 8, 531-540; Bezanilla 
et ah (1994) Biophys. J. 66, 101 1-1021). 
10 The second family of pore-forming subunits have only two TMDs. They are 

• essential subunits of inward-rectifying (IRK), G-protein-coupled (GIRK) arid ATP-sensitiVe 
(Katp) K"*~ channels. The inward rectification results from a voltage-dependent block by 

cytoplasmic Mg 2+ and polyamines (Matsuda, H. (1991) Anna. Rev. Physiol., 53, 289-298). 
A conserved domain, called the P domain, is present in all members of both families (Pongs, 

15 O. (1993) J. Membr. Bioh, 136, 1-8; Heginbotham et ah (1994) Biophys. J. 66,1061-1067; 
Mackinnon, R. (1995) Neuron, 14, 889-892; Pascual etah, (1995) Neuron., and 14, 1055- 
1063). This domain is an essential element of the aqueous, K + -selective pore. In both 
groups, the assembly of four subunits is necessary to form a functional K + channel 
(Mackinnon, R. (1991) Nature, 350, 232-235; Yang et ah, (1995) Neuron, 15, 1441-1447. 

20 In both six TMD and two TMD pore-forming subunit families, different subunits 

coded by different genes can associate to form heterotetramers with new channel properties 
(Isacoff et ah, (1 990) Nature, 345, 530-534). A selective formation of heteropolymeric 
channels may allow each cell to develop the best K + current repertoire suited to its function. 
Pore-forming a subunits of Kv channels are classified into different subfamilies according 

25 to their sequence similarity (Chandy et ah (1993) Trends Pharmacol. Sch, 14: 434). 
Tetramerization is believed to occur preferentially between members of each subgroup 
(Covarrubias et ah (1991) Neuron, 7, 763-773). The domain responsible for this selective 
association is localized in the N-terminal region and is conserved between members of the 
same subgroup. This domain is necessary for hetero- but not homo-muitimeric assembly 

30 within a subfamily and prevents co-assembly between subfamilies. Recently, pore-forming 
subunits with two TMDs were also shown to co-assemble to form heteropolymers (Duprat et 
ah (1995) Biochem. Biophys. Res. Commun., 212, 657-663. This heteropolymerization 
seems necessary to give functional GIRKs. IRKs are active as homopolymers but also form 
heteropolymers. 

35 New structural types of K + channels were identified recently in both humans and 

yeast. These channels have two P domains in their functional subunit instead of only one 
(Ketchum et ah (1995) Nature, 376, 690-695; Lesage et ah (1996) J. Bioh Chem., 271, 
4183-4187; Lesage et ah (1996) EMBOJ., 15, 1004-101 1; Reid et ah (1996) Receptors 
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Channels 4, 51-62). The human channel called TWIK-1 , has four TMDs. TWIK-1 is 
expressed widely in human tissues and is particularly abundant in the heart and the brain. . 
TWIK-1 currents are time independent and inwardly rectifying. These properties suggest 
that TWIK-1 channels are involved in the control of the background K+ membrane 
conductance (Lesage er al. (1996) EMBO J., 15, 1004-101 1). 

Potassium channels are potassium ion selective, and can determine membrane 
excitability (the ability of v for example, a neuron to.respo'nd to a stimulus and convert it into 
an impulse). Potassium chanriels can also influence the resting potential of membranes; 
wave forms and frequencies of action potentials, and thresholds of excitation. Potassium 
channels are typically expressed jn electrically excitable cells, e g: , neurons, muscle, 
endocrine, and egg cells, and may form heteromultimeric structures, e.g., composed of pore- 
forming and cytdplasmic subunits. Potassium channels may also be found in non-excitable 
cells, where they may play a role in, e.g. , signal transduction. Examples of potassium 
channels include: (1) the voltage-gated potassium channels, (2) the ligand-gated potassium 
channels, e.g., neurotransmitt;er'gated potassium channels, and (3) cyclic-nucleotide-gated 
potassium channels. Voltage-gated and ligand-gated potassium channels are expressed in 
the brain, e.g., in brainstem monoaminergic and forebrain cholinergic neurons, where they 
are involved in the release of neurotransmitters, or in the dendrites of hippocampal and 
neocortical pyramidal cells, where they are involved in the processes of learning and 
memory formation. For a detailed description of potassium channels, see Kandel E. R et 
al, Principles of Neural Science, second edition, (Elsevier Science Publishing Co., Inc., 
N. Y. ( 1 985)), the contents of which are incorporated herein by reference. 

The El -E2 ATPase family is a large superfamily of transport enzymes that contains 
at least 80 members found in diverse organisms such as bacteria, archaea, and eukaryotes 
(Palmgren, M. G. and Axelsen, K. B..(1998) Biochim. Biophys. Acta. 1365:37-45). These 
enzymes are involved in ATP hydrolysis-dependent transmembrane movement of a variety 
of inorganic cations (e.g., H + , Na + , K + , Ca 2+ , Cu 2+ , Cd + , and Mg 2+ ions) across a concentration 
gradient, whereby the enzyme converts the free energy of ATP hydrolysis into 
electrochemical ion gradients. El -E2 ATPases are also known as "P-type" ATPases, 
referring to the existence of a covalent high-energy phosphoryl-enzyme intermediate in the 
chemical reaction pathway of these transporters. Until recently, the superfamily contained 
four major groups: Ca 2+ transporting ATPases; Na7K + - and gastric H7K + transporting 
ATPases; plasma membrane IT transporting ATPases of plants, fungi, and lower 
eukaryotes; and all bacterial P-type ATPases (Kuhlbrandt et al. (1998) Curr. Opin Struct. 
35 5/o/. 8:510-516). 

E1-E2 ATPases are phosphorylated at a highly conserved DKTG sequence. 
Phosphorylation at this site is thought to control the enzyme's substrate affinity. Most El- 
E2 ATPases contain ten alpha-helical transmembrane domains, although additional domains 
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may be present. A majority of known gated-pore translocators contain twelve alpha-helices, 
"'" including Na7H* antiporters (West (1997) Biochim. Biophys. Acta 1331 :2 13-234). 

Members of the El -E2 ATPase superfamily ard able to generate electrochemical ion 
gradients which enable a variety of processes in the cell such as absorption, secretion, 
5 transmembrane signaling, nerve impulse transmission; excitation/contraction coupling, and 
growth and differentiation (Scarborough (1999) Curr. Op. Cell Biol 1 1 :5 1 7-522). These 
molecules are thus critical to normal cell' function and well-being of the organism. 

Recently, a new class of E1-E2 ATPases was identified, the aminophospholipid 
transporters or translocators. These transporters transport not cations, but phospholipids 
10 (Tang, X. et al. (1996) Scienci 272:1495-1497; Bull, L. N. et al,(1998) Ato. Genet 18:219- 
• 224; Mauro, I. et al. (1999) Biocf?em. Biophys. Res. Commun. 257:333-339). These * 
transporters are involved in cellular functions including bile acid secretion and maintenance 
of the asymmetrical integrity of the plasma membrane. 

Given the important biological and physiological roles played by the sugar 
15 transporter family of proteins, the potassium channel family of proteins, and the E 1 -E2 - 
ATPase family of proteins, there exists a need to identify novel potassium channel family 
members for use in a variety of diagnostic/prognostic, as w;ell as therapeutic applications 

Summary of the Invention 

20 The present invention is based, at least in part, on the discovery of novel human 

sugar transporter family members, referred to herein as "8099 and 46455" nucleic acid and 
polypeptide molecules. The 8099 and 46455 nucleic acid and polypeptide molecules of the 
present invention are useful as modulating agents in regulating a variety of cellular 
processes, e.g. , sugar homeostasis. Accordingly, in one aspect, this invention provides 
25 isolated nucleic acid molecules encoding 8099 and 46455 polypeptides or biologically 

active portions thereof, as well as nucleic acid fragments suitable as primers or hybridization 
probes for the detection of 8099 and 4645 5 -encoding nucleic acids. 

The present invention is also based, at least in part, on the discovery of novel 
potassium channel family members, referred to herein as "54414 and 53763" nucleic acid 
30 and polypeptide molecules. The 54414 and 53763 nucleic acid and protein molecules of the 
present invention are useful as modulating agents in regulating a variety of cellular 
processes, e.g. , gene expression, intra- or intercellular signaling ? and/or membrane 
excitability or conductance. Accordingly, in one aspect, this invention provides isolated 
nucleic acid molecules encoding 54414 and 53763 proteins or biologically active portions 
35 thereof, as well as nucleic acid fragments suitable as primers or hybridization probes for the 
detection of 54414 and 53763-encoding nucleic acids. 

The present invention is also based, at least in part, on the discovery of novel human 
phospholipid transporter family members, referred to herein as "67076, 67102, 44181, 

5 
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67084FL, or 67084alt" nucleic acid and polypeptide molecules. The 67076, 67102, 44181, 
67084FL, or 67084alt nucleic acid and polypeptide molecules of the present invention are 
useful as modulating agents in regulating a variety of cellular processes, e.g., phospholipid 
transport {e.g., aminophospholipid transport), absorption, secretion, gene expression, intra- 
5 or inter-cellular signaling, and/or cellular proliferation, growth, apoptosis, and/or 

differentiation. Accordingly, in one aspect, this invention provides isolated nucleic acid 
molecules encoding 67076, 67102, 441 81, 67084FL, or 67084alt polypeptides or . 
biologically active portions thereof, as well as nucleic acid fragments suitable as primers or 
hybridization probes for the detection of 67076, 67102, 44181, 67084FL, or 67084alt- 

10 encoding nucleic acids. 1 

• , In.one embodiment, the invention features an isolated nucleic acid molecule that ' 
includes the nucleotide sequence set forth in SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4. 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
NO: 1 3 , SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 8, SEQ ID NO: 19, SEQ ID NO:2 1 , 

15 SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:27. In another 
embodiment, the invention features an isolated nucleic acid molecule that encodes a 
polypeptide including the amino acid sequence set forth in, SEQ ID NO:2, SEQ ID NO:5, 
SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID 
NO:23, or SEQ ID NO:26. In another embodiment, the invention features an isolated 

20 nucleic acid molecule that includes the nucleotide sequence contained in the plasmid 
deposited with ATGC® as Accession Number nr 

— _» 5 n Ui 



In still other embodiments, the invention features isolated nucleic acid molecules 
including nucleotide sequences that are substantially identical (e.g. , 60% identical) to the 

25 nucleotide sequence set forth as SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID 

NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 
• ID NO: 1 5, SEQ ID NO; 1 6, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:21 , SEQ ID ' 
NO:22, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:27. The invention further features 
isolated nucleic acid molecules including at least 50 contiguous nucleotides of the 

30 nucleotide sequence set forth as SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID 

NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 
ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 8, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:27. In another embodiment the 
invention features isolated nucleic acid molecules which encode a polypeptide including an 

35 amino acid sequence that is substantially identical (e.g., 60% identical) to the amino acid 
sequence set forth as SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1 , SEQ ID 
NO:14, SEQ ID NO:17, SEQ IDNO:20, SEQ IDNO:23, or SEQ ID NO:26. The present 
invention also features nucleic acid molecules which encode allelic variants of the 
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polypeptide having the amino acid sequence set forth as SEQ ID Nd:2, SEQ ID NO:5, SEQ 
ID NO:8, SEQ ID NOrl 1, STEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ IDNO:23, 
or SEQ ID NO:26.. In addition to isolated nucleic acid molecules encoding full-length 
polypeptides, the present invention also features nucleic acid molecules which encode 

5 fragments, for example, biologically active or antigenic fragments, of the full-length 

polypeptides of the present invention (e.g., fragments including at least 10 contiguous amino 
acid residues of the amino.acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, 
SEQ ID NO: 11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ 
ID NO:26). In still other embodiments, the invention features nucleic acid molecules that 

10 are complementary to,antisense to, or hybridize under stringent conditions to the isolated 
nucleic acid molecules described herein. 

In another aspect, the invention provides vectors including the isolated nucleic acid 

molecules described herein (e.g., 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

■ i 

67084FL, or 67084alt-encoding nucleic acid molecules). Such vectors can optionally 
15 include nucleotide sequences encoding heterologous polypeptides. Also featured are host 
cells including such vectors (e.g., host cells including vectors suitable for prbducing 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid molecules 
and polypeptides). 

In another aspect, the invention features isolated 8099, 46455, 54414, 53763, 67076, 

20 67102, 44181, 67084FL, or 67084alt polypeptides and/or biologically active or antigenic 
fragments thereof. Exemplary embodiments' feature a polypeptide including the amino acid 
sequence set forth as SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: l 1, SEQ ID 
NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, a polypeptide 
including an amino acid sequence at least 60% identical to the amino acid sequence set forth 

25 as SEQ ID NO:2, SEQ ID NO:5, SEQ ID ND:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID 
NO: 1 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID N©:26, a polypeptide encoded by a 
nucleic acid molecule including a nucleotide sequence at least 60% identical to the 
nucleotide sequence set forth as SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 

30 IDNO:15, SEQ IDNO:16, SEQ IDNO:18, SEQ IDNO:19, SEQ IDNO:21, SEQ ID 

NO:22, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:27. Also featured are fragments of 
the full-length polypeptides described herein (e.g. , fragments including at least 10 
contiguous amino acid residues of the sequence set forth as SEQ ID NO:2, SEQ ID NO:5, 
SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO:20, SEQ ID 

35 NO:23, or SEQ ID NO:26) as well as allelic variants of the polypeptide having the amino 
acid sequence set forth as SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, 
SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26. 
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The 8099 and 46455 polypeptides and/or biologically active or antigenic fragments 
thereof, are useful, for' example, as reagents or targets in assays applicable'to treatment 
and/or diagnosis of 8099 and 46455 mediated or related disorders. In onCembodiment, 
8099 and/or 46455 polypeptides or fragments thereof, have an 8099 and/or 46455 activity 
In another embodiment, 8099 and/or 46455 polypeptides or fragments thereof, have at least 
one, preferably two, three, four, five, six, seven, eight, nine, ten, eleven, or twelve 
transmembrane domains and/or a sugar transporter family domain, and optionally have an 
8099 and/or 46455 activity. ' ' ' ■ 

The 54414 and 53763 polypeptides and/or biologically active or antigenic fragments 
thereof, are useful, for example, as reagents or targets in assays applicable to treatment 
and/or diagnosis of 54414 and 53763 mediated or related disorders. In one embodiment, a 
54414 AftD 53763 polypeptide or fragment thereof has a 54414 and 53763 activity. In ' 
another embodiment, a 54414 and 53763 polypeptide or fragment, thereof has at least one or 
more of the following domains or motifs: a transmembrane domain, an ion transport protein 
domain, a K + channel tetrame,risation domain, a P-loop motif, a pore domain, a potassium 
channel signature sequence motif, and/or a voltage sensor motif, and optionally, has a 54414 
or 53763 activity. . 

The 67076, 67102, 44181, 67084FL, or 67084alt polypeptides and/or biologically 
active or antigenic fragments thereof, are useful, for example, as reagents or targets in assays 
applicable to treatment and/or diagnosis of 67076, 67102, 44181, 67084FL, or 67084alt 
associated or related disorders. In one embodiment, a 67076, 67102, 441 81, 67084FL, or 
67084alt polypeptide or fragment thereof, has a 67076, 67102, 44181, 67084FL, or 67084alt 
activity. In another embodiment, a 67076, 67102, 44181, 67084FL, or 67084alt polypeptide 
or fragment thereof, includes at least one of the following domains, sites, or motifs: a 
transmembrane domain, anN-terminal large extramembrane domain, a C-terminal large 
extramembrane domain, an E1-E2 ATPases phosphorylation site, a P-type ATPase sequence 
1 motif, a P-type ATPase sequence 2 motif, a P-type ATPase sequence 3 motif, and/or one 
Or more phospholipid transporter specific amino acid resides, and optionally, has a 67076 
67102, 44181, 67084FL, or 67084alt activity. 

In a related aspect, the invention features antibodies (e.g. , antibodies which 
specifically bind to any one of the polypeptides described herein) as Well as fusion 
polypeptides including all or a fragment of a polypeptide described herein. 

The present invention further features methods for detecting 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides and/or 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid molecules, such 
methods featuring, for example, a probe, primer or antibody described herein. Also featured 
are kits, e.g., kits for the detection of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptides and/or 8099, 46455, 54414, 53763, 67076, 67102, ' 
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44 1 8 1 , 67084FL, or 67084alt nucleic acid molecules. In a related aspect, the invention 
features methods for identifying compounds which' bind to and/or modulate the activity of a 
8099, 46455, 54414, 53763, 67076, 67102, 441 81> 67084FL, or. 67084alt polypeptide or 
8099, 46455, 54414, 53763, 67076, 67102, 44i81,.67084FL, or 67084alt nucleic acid 
5 molecule described herein. Further featured are methods for modulating a 67076, 67 102; 
44181, 67084FL, or 67084alt activity. 

Other features and advantages of the invention will be apparent from the following 
detailed description and claims. i 1 

10 Brief Description of the Drawings 

■ Figures 1A-1B depict the cDNA sequence and predicted amino acid sequence of ' 

human 8099. The nucleotide sequence corresponds to nucleic acids 1 to 2725 of SEQ ID 
NO:l. The amino acid sequence corresponds to amino acids 1 to 617 of SEQ ID NO:2. The 
' coding region without the 5' and 3 ' untranslated regions of the human 8099 gene is shown 

15 in SEQ ID NO: 3. 

Figure 2 depicts a structural, hydrophobicity, and antigenicity analysis of the human 

8099 polypeptide (SEQ ID NO:2). 

Figure 3A-C depicts the results of a search which was performed against the HMM 

database in PF AM. 

20 Figure 4 depicts an alignment of the human 8099 amino acid sequence (SEQ ID 

NO:2) with the amino acid sequence of the E. coli galactose-proton symporter GALP using 
the CLUSTAL W (1.74) alignment program (having GenBank Accession No. P37021, set 
forthasSEQIDNO:28). 

Figure 5 depicts an alignment of the human 8099 amino acid sequence (SEQ ID 
25 NO:2) with the amino acid sequence of the E. coli arabinose-proton symporter ARAE using 
the CLUSTAL W (1.74) alignment program (having GenBank Accession No. P09830, set 
. forth as SEQ ID NO:29). 

Figure 6 depicts an alignment of the human 8099 amino acid sequence (SEQ ID 
NO:2) with the amino acid sequence of E. coli GALP and ARAE using the CLUSTAL W 
30 (1 .74) alignment program (having GenBank Accession Nos. P37021 and P09830, 
respectively, set forth as SEQ ID NOs:28 and 29, respectively). 

Figure 7 depicts an alignment of the human 8099 amino acid sequence (SEQ ID 
NO;2) with the amino acid sequence of the H. sapiens facilitate glucose transporter 
GLUT8 using the CLUSTAL W (1 .74) alignment program (having GenBank Accession No. 
35 Y02168,setforthasSEQIDNO:30). 
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Figures 8A-B depict the cDNA sequence and predicted amino acid sequence of 
human 46455. The nucleotide sequence, corresponds to nucleic acids 1 to 2230 of SEQ ID • 
NO:4. The amino acid sequence corresponds to amino acids 1 to 528' of SEQ ID NO:5. The 
coding region without the 5' and 3' untranslated regions of .the human 46455 gene is shown 
5 in SEQ ID NO: 6. 

Figure 9 depicts a structural, hydrophobicity, and antigenicity analysis of the human 
46455 polypeptide (SEQ ID NO:5). ' ' • . • 

Figure 10A-C depicts the results of a search which Was performed against the HMM 
database in PF AM. , 
10 Figure 11 depicts an alignment of the human 46455 amino acid sequence (SEQ ID 

' NO:5) with the amino acid sequence of C. elegans Z92825 using the CLUSTAL W (1.74)' 
alignment program (having GenBank Accession No. Z92825, set forth as SEQ ID NO:3 1). 
( ' Figure 12 A-D depicts the nucleotide sequence of the human 54414 cDNA and the 

corresponding amino acid sequence. The nucleotide sequence corresponds to nucleic acids 
15 1 to 4632 of SEQ ID NO:7. The amino acid sequence corresponds to amino acids 1 to 1118 
of SEQ ID NO:8. The coding region without the 5' or 3' untranslated regions of the human 
54414 gene is shown in SEQ ID NO:9. 

Figure 13 depicts a structural, hydrophobicity, and antigenicity analysis of the 
human 54414 polypeptide (SEQ ID NO:8). The locations of the 6 transmembrane domains, 
20 as well as the pore domain (P), are indicated. 

Figure 14 depicts the results of a search in the HMM database, using the amino acid 
sequence of human 544 14. 

Figure 15A-B depicts a Clustal W (1. 74) multiple sequence alignment of the human 
54414 amino acid sequence (54414.prot; SEQ ID NO:8) and the amino acid sequence of the 
25 Rattus norvegicus Slack potassium channel subunit (AF08973Q; SEQ ID NO:32; GenBank 
Accession No. AAC83350). Amino acid identities are indicated by stars. The six 
• transmembrane domains (TM1, TM2, etc.) are boxed. The pore domain, which contains the 
potassium channel signature sequence motif, is also boxed. 

Figure 16A-C depicts the nucleotide sequence of the human 53763 cDNA and the 
30 corresponding amino acid sequence. The nucleotide sequence corresponds to nucleic acids 
1 to 2847 of SEQ ID NO:10. The amino acid sequence corresponds to amino acids 1 to 638 
of SEQ ID NO: l 1 . The coding region without the 5' or 3' untranslated regions of the 
human 53763 gene is shown in SEQ ID NO: 12. 

Figure 1 7 depicts a structural, hydrophobicity, and antigenicity analysis of the 
35 human 53763 polypeptide (SEQ ID NO:l 1). The locations of the 6 transmembrane 
domains, as well as the pore domain (P), are indicated. 

Figure 18A-B depicts the results of a search in the HMM database, using the amino 
acid sequence of human 53763. 

10 
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Figure 19 depicts a Clustal W (1.74) sequence alignment of the human 53763 amino 
acid sequence (Fbh53763pat; S&Q ID NO: 11) and the amino acid sequence of the Rattus 
norvegicus voltage-gated potassium channel protein KV3.2 (KSHIII A) (ratCIKE; SEQ ID 
NO:33; GenBank Accession No. P22462).' 'Amino acid identities are indicated by stars. 
5 The six transmembrane domains (TMl , TM2, etc.) are boxed. The pore domain, which 
contains the potassium channel signature sequence motif, is also boxed. Plus signs (+) at 
every third position of the fourth transmembrane domain' (TM4), indicate the positively 
charged residues of the voltage sensor. 

Figures 20A-E depicts the cDNA sequence and predicted amino acid sequence of 
10 human 67076. The nucleotide sequence corresponds to nucleic acids 1 to 6582 of SEQ ID 
NO: 13. The aminoacid sequence corresponds to amino acids 1 to 1 129 of SEQ ID NO: 14. 
The codiiig region without the 5' and 3' untranslated regions of the human 67076 gene; is 
shown in SEQ ID NO: 15. t . 

Figure 21 depicts a structural, hydrophobicity, and antigenicity analysis of the 
15 human 67076 polypeptide (SEQ ID NO: 14). - 
Figure 22 depicts the results of a search in the HMM database, using* the amino acid 
sequence of human 67076. 

Figures 23 depicts a Clustal W (1 .74) alignment of the human 67076 amino acid 
sequence ("Fbh67076FL"; SEQ ID NO: 14) with the amino acid sequence of mouse 
20 Potential Phospholipid-Transporting ATPase IH (mouseATl H) (GenBank Accession No. 
P98197) (SEQ ID NO:34). The transmembrane domains ("TMl", "TM2", etc.), E1-E2 
ATPases phosphorylation site ("phosphorylation site"), and phospholipid transporter 
specific amino acid residues ("phospholipid transport") are boxed. 

Figures 24A-E depicts the cDNA sequence and predicted amino acid sequence of 
25 human 67102. The nucleotide sequence corresponds to nucleic acids 1 to 6074 of SEQ ID 
NO: 16. The amino acid sequence corresponds to amino acids 1 to 1426 of SEQ ID NO: 17. 
The coding region without the 5' and 3 ' untranslated regions of the human 67102 gene is 
shown in SEQ ID NO: 18. 

Figure 25 depicts a structural, hydrophobicity, and antigenicity analysis of the 
30 human 67102 polypeptide (SEQ ID NO:17). 

Figure 26A-B depicts the results of a search in the HMM database, using the amino 
acid sequence of human 67102. 

Figures 27A-B depicts a Clustal W (1 .74) alignment of the human 67102 amino acid 
sequence ("Fbh67102FL"; SEQ ID NO:17) with the amino acid sequence of mouse 
35 Potential Phospholipid-Transporting ATPase VA (mouseATSA) (GenBank Accession No. 
054827) (SEQ ID NO:35). The transmembrane domains ("TMl", "TM2", etc.), E1-E2 
ATPases phosphorylation site ("phosphorylation site"), and phospholipid transporter 
specific amino acid residues ("phospholipid transport") are boxed. 

11 
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Figures 28A-E depicts the cDNA sequence and predicted amino acid sequence of 
human 44181. The nucleotide sequence . corresponds to nucleic acids i to 7221 of SEQ ID • 
N0.19. The amino acid sequence corresponds to amino acids l to 1177 of SEQ IDNO 20 
The coding region without the 5' and 3' untranslated regions ofthe human 441 81 gene'is ' 
5 showninSEQIDNO:21. • 

Figure 29 depicts a structural, hydrophobicity, and antigenicity analysis of the 
human 44181 polypeptide. (SEQ ID NO:20). • . 

Figure 30A-B depicts the results of a search m me Hto database, using the amino 
acid sequence of human 44181. 

10 • ^ Fi8UreS 31A ' B dePiCtS ' a CIUStaI W (L74) mU ^ ple SeqUehce ^ ent ^ the human 
441 81 ammo add sequence ("Fbh44181"; SEQ ID NO:20) with the amino acid sequence of 

mouse PotentiarPhospholipid-Transporting ATPase IH (mouseATlH) (GenBank Accession 
, No.P98197)(SEQIDNO:34)and67076rFbh67076FL''; SEQIDNO-14) The 
transmembrane domains ("TMl", «TM2", etc.), ETE2 ATPases phosphorylation site 
15 (''phosphorylation site"), and phospholipid transporter specific amino acid residues 
("phospholipid transport") are boxed. 

Figures 32A-D depicts the cDNA sequence and predicted amino acid sequence of 
human 67084FL, The nucleotide sequence corresponds to nucleic acids 1 to 4198 of SEQ 
ID NO:22. The amino acid sequence corresponds to amino acids 1 to 1084 of SEQ ID 
20 NO:23. The coding region without the 5' and 3' untranslated regions of the human 
67084FL gene is shown in SEQ ID NO:24. 

Figure 33 depicts a structural, hydrophobicity, and antigenicity analysis of the 
human 67084FL polypeptide (SEQ ID NO:23). 

, Figures 34A-B depicts the results ofa search in the HMM database, using the amino 
25 acid sequence of human 67084FL. 

Figures 35A-B depicts a Clustal W (1 .74) alignment of the human 67084FL amino 
acid sequence ("Fbh67084FL"; SEQ ID NO:23) with the amino acid sequence of mouse 
Potential Phospholipid-Transporting ATPase IIV (mouseAT2B) (GenBank Accession 
No.:P98195)(SEQ'IDNO:36). The transmembrane domains ("TMl", "TM2" etc) E1-E2 
30 ATPases phosphorylation site ("phosphorylation site"), and phospholipid transporter 
specific amino acid residues ("phospholipid transport") are boxed. 

Figures 36A-D depicts the cDNA sequence and predicted amino acid sequence of 
human 67084alt. The nucleotide sequence corresponds to nucleic acids 1 to4231 of SEQ 
ID NO:25. The amino acid sequence corresponds to amino acids 1 to 1095 of SEQ ID 
35 NO:26. The coding region without the 5' and 3' untranslated regions of the human 67084alt 
gene is shown in SEQ ID NO:27. 

Figure 37 depicts a structural, hydrophobicity, and antigenicity analysis of the 
human 67084alt polypeptide (SEQ ID NO:26). 
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. Figures 38A-B depicts the results of a search in the HMM database, using the amino 
acid sequence of human 67084. 

Figures 39A-B depictsia Clustal W (1.74) alignment of the human" 67084alt amino 
acid sequence ("Fbh67084alt"; SEQ ID NO" :26) with the amino acid sequence of mouse 
5 Potential Phospholipid-Transporting ATPase I1V (mouseAT2B) (GenBank Accession 
No.:P98l95) (SEQ ID NO:36). The transmembrane domains ("TM1", "TM2", etc.), E1-E2 
ATPases phosphorylation site ("phosphorylation site"), and phospholipid transporter 
specific amino acid residues (''phospholipid transport") are boxed. ' '.. 

10 Detailed Description of the Invention 

The present invention is based, at least in part, on the discovery of novel sugar 

transporter family molecules, referred to herein as "8099 and 46455" nucleic acid and. 
polypeptide molecules. These novel molecules are capable of, for example, modulating a 
transporter mediated activity (e.g., a sugar transporter mediated activity) in a cell, e.g., a 
15 liver cell, fat ceil, muscle cell, or blood cell, such as an erythrocyte. These novel molecules 
are capable of transporting molecules, e.g. , hexoses such as D-glucose, D-fructose, D- 
galactose or mannose across biological membranes and, thus, play a role in or function in a 
variety of cellular processes, e.g. , maintenance of sugar homeostasis. Thus the 8099 and 
46455 molecules of the present invention provide novel diagnostic targets and therapeutic 
20 agents to control 8099 and 46455- associated disorders, as defined herein. 

The present invention is also based, at least in part, on the discovery of novel 
potassium channel family members, referred to herein as "54414 and 53763" nucleic acid 
and polypeptide molecules. These novel molecules are capable of, for example, modulating 
PCH mediated activities in a cell, e.g., a neuronal cell. Thus, the 54414 and 53763 
molecules of the present invention provide novel diagnostic targets and therapeutic agents to 
control 54414 or 53763 -associated disorders, as defined herein. 

The present invention also is based, at least in part, on the discovery of novel 
phospholipid transporter family molecules, referred to herein as "67076, 67102, 44181, 
67084FL, or 67084alt" nucleic acid and polypeptide molecules. These novel molecules are 
30 capable of, for example, transporting phospholipids (e.g., aminophospholipids such as 
phosphatidylserine and phosphatidylethanolamine, choline phospholipids such as 
phosphatidylcholine and sphingomyelin, and bile acids) across cellular membranes and, 
thus, play a role in or function in a variety of cellular processes, e.g., phospholipid transport, 
absorption, secretion, gene expression, intra- or inter-cellular signaling, and/or cellular 
35 proliferation, growth, and/or differentiation. Thus, the 67076, 67102, 44181, 67084FL, and 
67084alt molecules of the present invention provide novel diagnostic targets and therapeutic 
agents to control 67076, 67102, 44181, 67084FL, or 67084alt -associated disorders, as 
defined herein. 
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The term "family" when referring to the protein and nucleic acid molecules tif the 
invention is intended to mean two or more proteins or nucleic acid molecules having a 
common structural domain or motif and having sufficient amino acid or.micleotide sequence 
homology as defined herein. Such family members can be naturally or non-naturally 
5 occurring and can be from either the same or different species. For example, a family can 
contain a first protein of human origin as well as other distinct proteins of human origin of 
alternatively, can contain homologues of non-human origin, e.g. , rat or mouse proteins. 
Members of a family can also have common functional characteristics. • 

' t. 

0 8099 and 46455 Molecules of the Invent ion 

The family 6f 8099 and 46455 polypeptides comprise at least one "transmembrane 
domain" and at least one, preferably two, three, four, five, six, seven, eight, nine, ten, . 
eleven, or twelve transmembrane domains. As used herein, the term "transmembrane 
domain" includes an amino acid sequence of about 20-45 amino acid residues in length 

5 which spans the plasma membrane. More preferably, a transmembrane domain includes .. 
about at least 20, 25, 30, 35, 40, or 45 amino acid residues and'spans the plasma membrane. 
Transmembrane domains are rich in hydrophobic residues, and typically have an alpha- 
helical structure. In a preferred embodiment, at least 50%, 60%, 70%, 80%, 90%, 95% or 
more of the amino acids of a transmembrane domain are hydrophobic, e.g. , leucines, 
isoleucines, alanines, valines, phenylalanines, prolines or methionines. Transmembrane 
domains are described in, for example, Zagotta W.N. et al, (1996) Annual Rev. Neurosci. 
19: 235-263, the contents of which are incorporated herein by reference. A MEMSAT and 
additional analyses resulted in the identification of twelve transmembrane domains in the 
amino acid sequence of human 8099 (SEQ IDNO:2) at about residues 32-49, 81-101, 109- 
130, 138-156, 165-184, 198-217, 279.-301, 315-338, 346-364, 463-487, 499-521, and529- 
549. A MEMSAT and additional analyses resulted in the identification of twelve 
transmembrane domains in the amino acid sequence of human 46455 (SEQ ID NO:5) at 
about residues 58-74, 98-1 18, 126-145, 165-181, 188-205, 218-238, 273-294, 323-341, 357- 
377, 386-410, 423-441, and 462-485. 

Accordingly, 8099 and 46455 polypeptides having at least 50-60% homology, 
preferably about 60-70%, more preferably about 70-80%, or about 80-90% homology with 
at least one, preferably at least two, three, four, five, six, seven, eight, nine, ten, eleven, or 
twelve transmembrane domains of human 8099 and 46455, respectively are within the scope 
of the invention. 

Another embodiment of the invention features 8099 molecules which contain an N- 
terminal unique domain. The term "unique N-terminal domain" as used herein, refers to a 
protein domain of an 8099 protein. family member which includes amino acid residues N- 
terminal to the sixth transmembrane domain, e.g., the GLUT8-like domain in the amino acid 
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sequence of the 8099 protein. As used herein, a "unique N-terminal domain" refers to a 
protein domain which is at least about 1 50-200 amino acid residues in length, preferably at , 
least about 160-190 amino acid residues in length and shares, significantly more sequence 
homology with about residues 1 to 178 of SEQ IDNO:2 than with about residues 1 to 178 
5 of GLUT8. 

Accordingly, 8099 polypeptides having at least 50-60% homology, preferably about 
60-70%, more preferably eibout 70-80%, or about 80-90% homology with at least one 
unique N-terminal domain of human 8099 (e.g., about amino acids 1-178 of SEQ ID NO:2) 
are within the scope of the invention. . 
10 Yet another aspect of the invention features 8099 proteins having an "extended 

, exofacial loop" between transmembrane domains 9 and 1 0. Preferably, the first amino acid 
residue of an extended exofacial loop of 8099 is the first residue C-terminal to the amino 
acid residues of transmembrane domain 9 and the last residue of the exofacial loop is the 
' first residue N-terminal to the amino acid residues of transmembrane domain 10 of 8099. In 
15 a preferred embodiment, an extended exofacial loop is at least about 50-55, 55-60, 60-65, 
65-70, 70-75, 75-80, 80-85, 85-90, 90-95, 97 or more amino acid residues in length. For 
example, in one embodiment; an 8099 protein includes an "extended exofacial loop" of 
about amino acids 365-462 of SEQ ID NO:2 (97 amino acid residues in length). 

Accordingly, 8099 polypeptides having at least 50-60% homology, preferably about 
20 60-70%, more preferably about 70-80%, or about 80-90% homology with at least one 
extended exofacial loop of human 8099 are within the scope of the invention. 

In another embodiment, an 8099 and/or 4645 5 molecule of the present invention is 
identified based on the presence of at least one "sugar transporter family domain." As used 
herein, the term "sugar transporter family domain" includes a protein domain having at least 
25 about 300-600 amino acid residues and a sugar transporter mediated activity. Preferably, a 
sugar transporter family domain includes a polypeptide:having an amino acid sequence of 
, about 350-550, 400-550, or more preferably, about 41 1 or 521 amino acid residues and a 
sugar transporter mediated activity. To identify the presence of a sugar transporter family 
domain in an 8099 and/or an 46455 protein, and make the determination that a protein of 
30 interest has a particular profile, the amino acid sequence of the protein may be searched 
against a database of known protein domains (e.g., the PFAM HMM database). A PFAM 
' sugar transporter family domain has been assigned the PF AM Accession PF00083. A 
search was performed against the PFAM HMM database resulting in the identification of a 
sugar transporter family domain in the amino acid sequence of human 8099 (SEQ ID NO:2) 
35 at about residues 43-564 of SEQ ID NO:2. A search was performed against the PFAM 
HMM database resulting in the identification of a sugar transporter family domain in the 
amino acid sequence of human 46455 (SEQ ID NO:5) at about residues 58-487 of SEQ ID 
NO:5. 

15 
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Preferably a "sugar transporter family domain'Vhas a "sugar transporter mediated 
activity" as described herein. For example, a sugar transporter family domain may have the- 
ability to bind a monosaccharide (e.g.- D-gliicose, D-fructose, D^galactose and/or mannose)- 
the ability to transport a monosaccharide (e.g., D-glucqse, D-fructose, D-galactose and/or ' 
mannose) in a constitutive manner or in response to stimuli (e.g., insulin) across a cell 
membrane (e.g., a liver cell membrane, fat cell membrane, muscle cell membrane, and/or 
blood cell membrane, such as an erythrocyte membrane); the ability to function as a 
neuronal transporter; the ability to mediate trans-epithelial mbvement; and/or the ability to 
modulate sugar homeostasis in a cell. Accordingly, identifying the presence of a "sugar 
transporter family domain" can include isolating a fragment of an 8099 and/or an 46455 
' molecule (e.g., an 8099 and/or an. 46455 polypeptide) and assaying for the ability of the ' 
fragment to exhibit one of the aforementioned sugar transporter mediated activities. 

A description of the Pfam database can be found in Sonhammer etal. (1997) 
Proteins 28:405-420 and a detailed description of HMMs can be found, for example in 
Gnbskov et al.{\990) Meth. Enzymol. 183:146-159; Gribskove/tf/.(1987) Proc. Natl Acad. 
Sci. USA 84:4355-4358; Krogh et al. ( 1 994) J. Mol. Biol. 235:1501-1531; and Stultzer 
a/.(1993) Protein Sci. 2:305-314, the contents of which are, incorporated herein by reference. 

In a preferred embodiment, the 8099 and/or 46455 molecules of the invention 
include at least one, preferably two, even more preferably at least three, four, five, six, 
seven,.eight, nine, ten, eleven, or twelve transmembrane domain(s) and/or at'least'one'sugar 
transporter family domain. In another preferred embodiment, the 8099 molecules of the 
invention include at least one, preferably two, even more preferably at least three, four, five, 
six, seven, eight, nine, ten, eleven, or twelve transmembrane domain(s), at least one sugar ' 

, " y a &st one unique N-terminal domain, and/or at least one 

extended exofacial loop. 

Isolated polypeptides of the present invention, preferably 8099 or 46455 
polypeptides, have an amino acid sequence sufficiently identical to the amino acid sequence 
of SEQ ID NO:2 or 5 or are encoded by a nucleotide sequence sufficiently identical to SEQ 
ID NO:l, 3, 4 or 6. As used herein, the term "sufficiently identical" refers to a first amino 
acid or nucleotide sequence which contains a sufficient or minimum number of identical or 
equivalent (e.g. , an amino acid residue which has a similar side chain) amino acid residues 
or nucleotides to a second amino acid or nucleotide sequence such that the first and second 
amino acid or nucleotide sequences share common structural domains or motifs and/or a 
common functional activity. For example, amino acid or nucleotide sequences which share 
common structural domains having at least 50%, 55%, 60%, 65%, 70%, 75% 80% 85% 
85%, 90%, 95%, 96%, 97%, 98%, 99% or more homology or identity across the amino acid 
sequences of the domains and contain at least one and preferably two structural domains or 
motifs, are defined herein as sufficiently identical. Furthermore, amino acid or nucleotide 
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sequences which share at least 50%, 55%, 60%, 65%, 70%, 75%, ^0%, 85%, 85%, 90%, 
95%, 96%, 97%, 98%; "99% or more homology or identity and share a common functional 
activity are defined herein as sufficiently identical. • ' 

In a preferred embodiment, an 8099' and/or 46455 polypeptide includes at least one 

5 or more of the following domains : a transmembrane domain and/or a sugar transporter 
family domain, and has an amino acid sequence at least about 50%, 55%, 60%, 65%, 70%, 
75%, 80%, 85%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more homologous' or identical 
to the amino acid sequence of SEQ ID NO:2 or 5, or 'the amino acid sequences encoded ;by 

the DNA inserts of the plasmids deposited with ATCC as Accession Numbers _ and/or 

10 . in yet another preferred embodiment, an 8099 and/or an 46455 polypeptide includes 

at least one or more' of the following domains: a transmembrane domain and/or a sugar 
transporter family domain, and is encoded by a nucleic acid molecule having a nucleotide 
sequence which hybridizes under stringent hybridization conditions to a complement of a 
nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3, 
15 SEQ ID NO:4 or SEQ ID NO:6. In another preferred embodiment, an 8099 and/or an 46455 
polypeptide includes at least one or more of the following domains: a transmembrane 
domain and/or a sugar transporter family domain, and has an 8099 and/or an 46455 activity. 

As used interchangeably herein, an "8099 activity", "46455 activity", "biological 
activity of 8099", "biological activity of 46455", "functional activity of 8099" or "functional 
20 activity of 46455" refers to an activity exerted by an 8099 and/or 46455. polypeptide or 
nucleic acid molecule on an 8099 and/or 46455 responsive cell or tissue, or on an 8099 
and/or 46455 polypeptide substrate, as determined in vivo, or in vitro, according to standard 
techniques. In one embodiment, an 8099 and/or 46455 activity is a direct activity, such as 
an association with an 8099- and/or 46455-target molecule. As used herein, a "substrate," 
25 "target molecule," or "binding partner" is a molecule with which an 8099 and/or 46455 
polypeptide binds or interacts in nature, such that 8099- and/or 46455-mediated function is 
achieved. An 8099 and/or 46455 target molecule can be a non- 8099 and/or a non-46455 
molecule or an 8099 and/or 46455 polypeptide or polypeptide of the present invention. In 
an exemplary embodiment, an 8099 and/br 46455 target molecule is an 8099 and/or 46455 
30 ligand, e.g. , a sugar transporter ligand such D-glucose, D-fructose, D-galactose, and/or 
mannose. Alternatively, an 8099 and/or 46455 activity is an indirect activity, such as a 
cellular signaling activity mediated by interaction of the 8099 and/or 46455 polypeptide 
with an 8099 and/or 46455 ligand. The biological activities of 8099 and/or 46455 are 
described herein. For example, the 8099 and/or 46455 polypeptides of the present invention 
35 can have one or more of the following activities: (1) bind a monosaccharide, e.g., D-glucose, 
D-fructose, D-galactose, and/or mannose, (2) transport monosaccharides across a cell 
membrane, (3) influence insulin and/or glucagon secretion, (4) maintain sugar homeostasis 
in a cell, (5) function as a neuronal transporter, and (6) mediate trans-epithelial movement in 
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a cell.. Moreover, in a preferred embodiment, 8099 and/or 46455 molecules of the present 
invention, 8099 and/olr 46455 antibodies, 8699 and/or 46455 modulators are useful in at 
least one of the following: (l)imodulation of insulin sensitivity; (2) modulation of blood 
sugar levels; (3) treatment of blood sugar level disorders (e.g., diabetes); and/or (4) 
5 modulation of insulin resistance. 

The nucleotide sequence of the isolated human 8099 and 46455 cDNAs and the 
predicted amino acid sequences of the human 8099 and 46455 polypeptides arfe shown in 
Figures 1 and 8 and in SEQ ID NOs: 1 and 2, and SE<Q ID NOs:4 and 5, respectively! ; 
Plasmids containing the nucleotide sequences encoding human 8099 or 46455 were 
deposited With the American Type Culture Collection (ATCC), 10801 University 

Boulevard, Manassas, VA 2011 0-2209, on . and assigned Accession Numbers . 

or . These deposits will be maintained under the terms of the Budapest Treaty on 

the International Recognition of the Deposit of Microorganisms for the Purposes of Patent 
Procedure. These deposits were made merely as a convenience for those of skill in the art 
and are not an admission that a deposit is required under 35 U.S.C. §112. 

The human 8099 gene, which is approximately 2725 nucleotides in length, encodes '< 
polypeptide which is approximately 617 amino acid residues iri length. The human 46455 
gene, which is approximately 2230 nucleotides in length, encodes a polypeptide which is 
approximately 528 amino acid residues in length. 

54414 and 53763 Molecules of the Invention 

The family of 54414 and 53763 proteins of the present invention comprises at least 
one transmembrane domain, preferably at least 2 or 3 transmembrane domains, more 
preferably 4 or 5 transmembrane domains, and most preferably, 6 transmembrane domains. 
Amino acid residues 64-83., 104-127,, 135-153, 161-173, 199-217, and 257-274 of the 
human 54414 protein (SEQ ID NO:8) are predicted to comprise transmembrane domains. 
Amino acid residues 230-248, 287-303, 314-335, 346-368, 382-402, and 451-473 of the 
human 53763 protein (SEQ ID NO:l 1) are predicted to comprise transmembrane domains. 

In another embodiment, members of the 54414 and 53763 family of proteins include 
at least one "ion transport protein domain" in the protein or corresponding nucleic acid 
molecule. As used herein, the term "ion transport protein domain" includes a protein 
domain having at least about 150-310 amino acid residues and a bit score of at least 200 
when compared against an ion transport protein domain Hidden Markov Model (HMM), 
e.gi 9 PFAM Accession Number PF00520. Preferably, an ion transport protein domain 
includes a protein domain having an amino acid sequence of about 170-290, 190-270, 210- 
250, or more preferably about 173 or 191 amino acid residues. To identify the presence of 
an ion transport protein domain in a 54414 or 53763 protein, and make the determination 
that a protein of interest has a particular profile, the amino acid sequence of the protein is 
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searched against a database of known protein motifs and/or domains (e.g. , the HMM 
database). The ion transport protein domain (HMM) has been assigned the PFAM 
Accession number PF00520. A search was performed against the HMM database resulting 
in the identification of an ion transport protein domain in the amino acid sequence of human 
5 54414 at about residues 1 04-277 of SEQ ID NO:8 and in the amino acid sequence of human 
53763 about residues 281-472 of SEQ ID NO: l 1. 

Preferably an ion transport protein domain is at least about 1 50-3 10 amino acid 
residues and has an "ion transport protein domain activity", for example, the ability to 
interact with a 54414 or 53763 substrate or target molecule (e.g., a potassium ion) and/or to 
10 regulate 54414 or 53763 activity. Accordingly, identifying the presence of an "ion transport 
i protein domain" can include isolating a fragment of a 54414 or. 53763 molecule (e.g., a 1 
54414 or 53763 polypeptide) and assaying for the ability of the fragment to exhibit one of 
the aforementionedi ion transport protein domain activities. 

In another embodiment, members of the 54414 and 53763 family of proteins include 
15 at least one "K + channel tetramerisation domain" in the protein or corresponding nucleic 
acid molecule. As used herein, the term "K + channel tetramerisation domain" includes a 

protein domain having at least about 70-230 amino acid residues and a bit score of at least 

». 

80 when compared against a K + channel tetramerisation domain Hidden Markov Model 
(HMM), e.g., PFAM Accession Number PF02214. Preferably, a K + channel tetramerisation 

20 domain includes a protein domain having an amino acid sequence of about 90-21 0, 1 1 0-1 90, 
130-170, or more preferably about 149 amino acid residues, and a bit score of at least 100, 
120, 140, or more preferably, 1 56.7. To identify the presence of a K + channel 
tetramerisation domain in a 54414 or 53763 protein, and make the determination that a 
protein of interest has a particular profile, the amino acid sequence of the protein is searched 

25 against a database of known protein motifs and/or domains (e.g., the HMM database). The 
K + channel tetramerisation domain (HMM) has been assigned the PFAM Accession number 
. PF02214. A search was performed against the HMM database resulting in the identification 
of a K + channel tetramerisation domain in the amino acid sequence of human 53763 at about 
residues 8-1 56 of SEQ ID NO: 1 1 . 

30 Preferably a K + channel tetramerisation domain is at least about 70-230 amino acid 

residues and has an "K + channel tetramerisation domain activity", for example, the ability to 
interact with one or more potassium channel subunits (e.g., 54414 or 53763 molecules, or 
non-,5441 4 or 53763 potassium channel subunits), the ability to regulate assembly of a 
54414 or 53763 molecule into a potassium channel tetramer, and/or to regulate 54414 or 

35 53763 activity. Accordingly, identifying the presence of an "K + channel tetramerisation 
domain" can include isolating a fragment of a 54414 or 53763 molecule (e.g., a 54414 or 
53763 polypeptide) and assaying for the ability of the fragment to exhibit one of the 
aforementioned K + channel tetramerisation domain activities. 
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In Another embodiment, a 544 1 4 or 53763 protein of the present invention is K 
identified based on th£ presenceof an "ATP7GTP-binding sit motif A (P-loop) motif, 
referred to alternatively herein as a "P-loop motif, in the protein or corresponding nucleic 
acid molecule. Preferably, a P-loop motif includes a protein motif which is about 4-15, 5- 
5 13,6-11, 7-9, or preferably about 8 amino acid residues. The P-loop motif functions in 

binding ATP and/or GTP via interaction with the phosphate groups of the nucleotide and has 
been assigned Prosite™ Accession Number PS000 17. To identify the presence of a P-loop 
motif in a 544 1 4 or 53763 protein, and to make the determination that a protein of interest 
has a particular profile, the amino acid sequence of the protein may be searched against a 

10 database of known protein domains or motifs (e. g. , the Prosite™ database) using the default 
parameters (available at the ProSite website). A search was performed against the ProSite 
database resulting in the identification of a P-loop motif in the amino acid sequence of 
human 54414 (SEQ ID NO:8) at about residues 1007-1014. ' ■ ■ ; 
_ In another embodiment, a 544 1 4 or 53763 protein of the present invention is 

1 5 identified based oh the presence of a "pore domain", alternatively referred to herein as a "P- 
region domain", in the protein or corresponding nucl eic acid molecule. As iised 
interchangeably herein, the terms "pore .domain" and "P-regiori domain" include a protein 
domain having about 10-30, 12-28, 13-25, 14-24, 15-23, or preferably about 16-22 amino 
acid residues, which is involved in lining the potassium channel pore. A pore domain is 

20 typically found between transmembrane domains of potassium channels and is believed to 
be a major determinant of ion selectivity in potassium channels. Preferably, a pore domain 
includes a potassium channel signature motif, as defined herein. Pore domains are described 
in, for example, Warmke et al. (1991) Science 252:1560-1562; Zagotta W.N. et al. (1996) 
Annu. Rev, Neuroscl 19:235-63; Pongs, O. (1993) J. Membr. Biol 136:1-8; Heginbotham et 

25 al. (1994) Biophys. J. 66:1061-1067; .Mackinnon, R. (1995) Neuron 14:889-892; and 

Pascual et al. (1 995) Neuron 14: 1 055- 1 063), the contents of which are incorporated herein 
by reference. A pore domain was identified in the amino acid sequence of human 54414 at 
about residues 229-250 of SEQ ID NO:8. A pore domain was identified in the amino acid 
sequence of human 53763 at about residues 426-441 of SEQ ID NO:l 1 . 

30 In a further embodiment, a 544 1 4 or 53763 protein of the present invention is 

identified based on the presence of a "potassium channel signature sequence motif in the 
protein or corresponding nucleic acid molecule. As used herein, the term "potassium 
channel signature sequence motif includes a protein motif which is diagnostic for 
potassium channels. Preferably, a potassium channel signature sequence motif has the 

35 consensus sequence T-X-X-T-X-G-hydrophobic-G (see Joiner, W. J. et al. (1 998) Nat 
Neuroscl 1:462-469 and references cited therein), wherein "X" indicates any amino acid 
residue, and "hydrophobic" indicates any hydrophobic amino acid residue. Preferably, a 
potassium channel signature sequence motif is included within a pore domain and includes 
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at least 1, 2, 3, 4, 5, 6, 7, or more preferably, 8 amino acid residues that match the consensus 
sequence for a potassium channel signature sequence motif. A potassium channel signature 
sequence motif was identified in the gmino acid sequence of human 54414 at about residues 
239-246 of SEQ ID NO:8. A potassium channel signature sequence motif was identified, in 

5 the amino acid sequence of human 53763 at about residues 436-441 of SEQ ID-NO: 1 1 . 

In still another embodiment, a 54414 or 53763 protein of the present invention is 
identified based on the presence of a "voltage sensor motif V alternatively referred to simply 
as a "voltage sensor", in tlie protein or the corresponding nucleic acid molecule. As used 
interchangeably herein, the terms "voltage sensor motif and "voltage sensor" include a 

10 protein motif having about 10 T 30, 1 1-26, 12-24, 13-22, 14-20, 15-18, or more preferably 16 
, amino acid residues, which is involved in sensing voltage differences between the two sides 
of the plasma membrane of a cell. Preferably, a voltage sensor motif includes at least 1 , 2, 
3, 4, 5, or more preferably, 6 positively charged amino acid residues, which are preferably 
' spaced apart by at least 1, or preferably 2, non-positively charged amino acid residues. 

15 Preferably, a voltage sensor motif is included within and/or overlaps with a transmembrane 
domain, more preferably the fourth transmembrane, of the 54414 or 53763 protein in which 
it is found. A voltage sensor motif was identified in the amino acid sequence of human 
53763 at about residues 348-363 of SEQ ID NO:8. The positively charged amino acid 
residues of the human 53763 voltage sensor were identified at about residues 348, 351, 354, 

20 357, 360, and 363 of SEQ ID NO:8. No voltage sensor was identified in human 54414. 

Isolated proteins of the present invention, preferably 54414 or 53763 proteins, have 
an amino acid sequence sufficiently homologous to the amino acid sequence of SEQ ID 
NO: 8 or SEQ ID NO: 11, or are encoded by a nucleotide sequence sufficiently homologous 
to SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, or SEQ ID NO:12. Amino acid or 

25 nucleotide sequences which share common structural domains having at least 50%, 55%, 
60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 95%, 96%, 97%* 98%, 99% or more 
homology or identity across the amino acid sequences of the domains arid contain at least 
one and preferably two structural domains or motifs, are defined herein as sufficiently 
homologous. Furthermore, amino acid or nucleotide sequences which share at least 50%, 

30 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more 
homology or identity and share a common functional activity are defined herein as 
' sufficiently homologous. 

In a preferred embodiment, a 54414 or 53763 protein includes at least one or more of 
the following domains or motifs: a transmembrane domain, an ion transport protein 

35 domain, a K + channel tetramerisation domain, a P-loop motif, a pore domain, a potassium 
channel signature sequence motif, and/or a voltage sensor motif, and has an amino acid 
sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 95%, 
96%, 97%, 98%, 99% or more homologous or identical to the amino acid sequence of SEQ 
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ID NO:8 or 1 1, or the amino acid sequence encoded by the DNA insert of the plasmid 
- depositedwithATCCasAc<^ion.Number___or__. In yet another preferred 
embodiment, a 54414 or 53763 protein includes at least one or more of the following 
domains or motifs: a transmembrane domain, an ion transport protein domain, a K + channel 
5 tetramerisation domain, a P-loop motif, a pore domain, a potassium channel signature 
sequence motif, and/or a voltage sensor motif, and is encoded by a nucleic acid molecule 
having a nucleotide sequence which hybridizes under stringent hybridization conditions to a 
complement of a nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:7, 9, 10, or 12. In another preferred embodiment, a 54414 or 53763 protein includes at 
10 least one or more of the following domains or motifs: a transmembrane domain, an ion 
• transport protein domain, a K + channel tetramerisation domain, a P-loop motif, a pore ' 
domain, a potassium channel signature sequence motif, and/or a voltage sensor motif, and 
has a 54414 or 53763 activity. 

As used interchangeably herein, a "544 1 4 or 53763 activity", "biological activity of 

15 54414 or 53763" or "functional activity of 54414 or 53763", includes an activity exerted or 
mediated by a 54414 or 53763 protein, polypeptide or nucleic acid molecule when 
expressed in a cell or on a membrane, as determined in v/vy or in vitro, according to 
standard techniques. In one embodiment, a 54414 or 53763 activity is a direct activity, such 
as transport of a 54414 or 53763 substrate (e.g., a potassium ion). In another embodiment, a 

20 54414 or 53763 activity is an indirect activity mediated, for example, by interaction of a 
54414 or 53763 molecule with a 54414 or 53763 target molecule or binding partner. As 
used herein, a "target molecule" or "binding partner" is a molecule with which a 54414 or 
53763 protein binds or interacts in nature, such that function of the target molecule or 
binding partner is modulated. In an exemplary embodiment, a 54414 or 53763 target 

25 molecule or binding partner is a 54414 or 53763 polypeptide or a non-54414 or 53763 
potassium channel subunit. 

In a preferred embodiment, a 54414 or 53763 activity is at least one of the following 
activities: (i) interaction with a 54414 or 53763 substrate (e.g., a potassium ion or acyclic 
nucleotide); (ii) conductance or transport of a 54414 or 53763 substrate aqross a cellular 

30 membrane; (Hi) interaction with a second protein (e.g., a second 54414 or 53763 subunit or 
a non-54414 or 53763 potassium channel subunit); (iv) modulation (e.g., maintenance 
and/or rectifacation) of membrane potentials; (v) regulation of target molecule availability or 
activity; (vi) modulation of intra- or intercellular signaling and/or gene transcription (e.g. , 
either directly or indirectly); (viii) generation of outwardly rectifying currents; (viii) 

35 modulation of membrane excitability; (ix) modulation of the release of neurotransmitters; 
(x) regulation of contractility (e.g., of smooth muscle cells), secretion, and/or synaptic 
transmission; and/or (xi) modulation of processes which underlie learning and memory . 
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Preferred activities of 54414 further include at least one of the following activities: 
(i) interaction with md*i-K potassium channels (i:e., large conductance channels, in- 
particular Slp)\ (ii) modulation of maxi-K potassium channel activity (e.gA iS/o-mediated 
activities); (iii) generation of intermediate conductance channels; and/or (iv) regulation of 
5 contractility (e.g., of smooth muscle cells), secretion, and/or synaptic transmission, in 
particular, via modulation of SIo. 

Preferred activities of 53763 further include at least one of the following activities: 
(i) interaction with Shaker (Sh) potassiubi channels and/or channel stibunits; (ii) inoculation 
of Shaker (Sh) potassium channel activity (e.g., termination of prolonged membrane 
10 depolarization; (iii) modulation of high voltage activating channel activity and/or 
inactivating channel activity, and the like. 

The nucleotide sequence of the isolated human 54414 cDNA and the predicted, 
amino acid .sequence encoded by the 54414 cDNA are shown in Figures 12A-C and in SEQ 
ID NOs:7 and 8, respectively. A plasmid containing the human 54414 cDNA was deposited 
15 with the American Type Culture Collection (ATCC), 10801 University Boulevard, 

Manassas, V A 2011 0-2209, on and assigned Accession kumber - - 1 . This 

deposit will be maintained under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. This 
deposit were made merely as a convenience for those of skill in the art and is not an 
20 admission that a deposit is required under 35 U.S.Q. § 1 12. 

The human 54414 gene, which is approximately 4632 nucleotides in length, encodes 
a protein having a molecular weight of approximately 123 kD and which is approximately 
1118 amino acid residues in length. 

The nucleotide sequence of the isolated human 53763 cDNA and the predicted 
25 amino acid sequence encoded by the 53763 cDNA are shown in Figures 16A-C and in SEQ 
ID NOs: 1 0 and 1 1 , respectively. A plasmid containing the human 53763 cDNA was 
deposited with the American Type Culture Collection (ATCC), 1 0801 University 

Boulevard, Manassas, VA 201 10-2209, on and assigned Accession Number . 

This deposit will be maintained under the terms of the Budapest Treaty on the International 
30 Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. This 
deposit were made merely as a convenience for those of skill in the art and is not an 
admission that a deposit is required under 35 U.S.C. §112. 

The human 53763 gene, which is approximately 2847 nucleotides in length, encodes 
a protein having a molecular weight of approximately 70.2 kD and which is approximately 
35 638 amino acid residues in length. 
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67076, 67102, 44181, 67084FL, and 67084alt Molecules of the Invention 

The 67076, 67V02, 44181, 67084FL; and 67084alt polypeptides comprise at least . 

one "transmembrane domain"iand preferably eight, nine, or ten transmembrane domains. A 

MEMSAT analysis and a structural, hydrophobicity, and antigenicity analysis also resulted 

in the identification of ten transmembrane domains in the amino acid sequence of human 

67076 (SEQ ID NO:14) at about residues 57-77, 84-105, 292-313, 345-365, 863-883, 905- 

926,956-977,989-1009, 1021-1041, and IO6O-IO87. A MEMS AT analysis arid a 

structural, hydrophobicity, and antigenicity analysis resulted in the identification often ',. 

transmembrane domains in the amino acid sequence of human 67102 (SEQ ID NO: 17) at 

about residues 98-1 15,' 122-140, 322-344, 366-390, 582-601, 752-770, 1145-1166, 1225- 

1246, 1253-1276, arid 1298-1317. A MEMSAT analysis and a structural, hydrophobicity, 

and antigenicity- analysis resulted in the identification often transmembrane domains in the 

amino acid sequence of human 441 81 (SEQ ID NO:20) at about residues 56-72, 87-103, 
290-31 1, 343-363, 878-898, 91 1-931, 961-982, 995-1015, 1027-1047, and 1062-1086. A 

15 MEMSAT analysis and a structural, hydrophobicity, and antigenicity analysis resulted in-the 
identification often transmembrane domains in the amino acid 'sequence of human 67084FL 
(SEQ ID NO:23) at about residues 104-120, 124-144, 331-350,'357-374, 887-903, 912-931, 
961-983, 990-1008, 1015-1035, and 1043-1067. A MEMSAT analysis and a structural, 
hydrophobicity, and antigenicity analysis resulted in the identification of ten transmembrane 

0 domains in the amino acid sequence of human 670S4alt (SEQ ID NO:26) at about residues 
104-120, 124-144, 331-350, 357-379, 887-903, 912-931, 961-983, 990-1008, 1015-1035, 
and 1054-1078. 

The family of 67076, 67102, 44181, 67084FL, or 67084alt proteins of the present 
invention also comprises at least one "extramembrane domain" in the protein or 

5 corresponding nucleic acid molecule.. As used herein, an "extramembrane domain" includes 
a domain having greater than 20 amino acid residues that is found between transmembrane 
domains, preferably on the cytoplasmic side of the plasma membrane, and does not span or 
traverse the plasma membrane; An extramembrane domain preferably includes at least one, 
two, three, four or more motifs or consensus sequences characteristic of P-type ATPases, 

0 i.e., includes one, two, three, four, or more "P-type ATPase consensus sequences or motifs". 
As used herein, the phrase "P-type ATPase consensus sequences or motifs" includes any 
consensus sequence or motif known in the art to be characteristic of P-type ATPases, 
including, but not limited to, the P-type ATPase sequence 1 motif (as defined herein), the P- 
type ATPase sequence 2 motif (as defined herein), the P-type ATPase sequence 3 motif (as 

5 defined herein), and the El -E2 ATPases phosphorylation site (as defined herein). 

In one embodiment, the family of 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt 
proteins of the present invention comprises at least one "N-terminal" large extramembrane 
domain in the protein or corresponding nucleic acid molecule. As used herein, an "N- 
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terminal" large extramembrane domain is found in the N-terminal 1 /3 rd of the protein, 
preferably between the second and third transmembrane domains of a 67076, 67102, 44181, 
67084FL, or 67084alt protein and includes about 60-300, 80,-280, 100-260, 120-240, 140- 
220, 160-200, or preferably, 1 80, 1 85, or 1 86 amino acid residues. In a preferred 
5 embodiment, an N-terminal large extramembrane domain includes at least one P-type 
ATPase sequence 1 motif (as described herein). An N-terminal large extramembrane 
domain was identified in the amino acid sequence of human 67Q76 at about residues 106- 
291 of SEQ ID NO:14. An N-terminal large extramembrane domain was identified in the 
amino acid sequence ofhuman 67102 at about residues 141-321 of SEQ ID NO; 17, An N- 
10 terminal large extramembrane domain was identified in; the amino acid sequence ofhuman 
i 44 1 81 at about residues 1 04-289 of SEQ ID NO:20l An N-terminal large extramembrane 1 
domain was identified in the amino acid sequence of human 67084FL at about residues 145- 
330 of SEQ ID NO:23 . An N-terminal large extramembrane domain was identified in the 
' amino acid sequence ofhuman 67087alt at about residues 145-330 of SEQ ID NO:26. 
15 The family of 67076, 67102, 44181, 67084FL, or 67084alt proteins of the present- 

invention also comprises at least one "C-terminaT large extramembrane domain in the 
protein or corresponding nucleic acid molecule. As used herein, a "C-terminal" large 
extramembrane domain is found in the C-terminal 2/3 rds of the protein, preferably between 
the fourth and fifth transmembrane domains of a 67076, 67 1 02, 44181, 67084FL, or 
20 67084alt protein and includes about 150-1000, 300-900, 370-850, 400-820, 430-790, 460- 
760, 430-730, 460-700, 430-670, 460-640, 430-610, 490-580, 510-550, or preferably, 190, 
506, or 523 amino acid residues. In a preferred embodiment, a C-terminal large 
extramembrane domain includes at least one or more of the following motifs: a P-type 
ATPase sequence 2 motif (as described herein), a P-type ATPase sequence 3 motif (as 
25 defined herein), and/or an E1-E2 ATPases phosphorylation site (as defined herein). A C- 
terminal large extramembrane domain was identified iel the amino acid sequence ofhuman 
67076 at about residues 366-862 of SEQ ID NO: 14. A C-terminai large extramembrane 
domain was identified in the amino acid sequence of human 67 1 02 at about residues 39 1 - 
581 of SEQ ID NO;17. A C-terminal large extramembrane domain was identified in the 
30 amino acid sequence of human 44 1 8 1 at about residues 364-877 of SEQ ID NO:20. A C- 
terminal large extramembrane domain was identified in the amino acid sequence ofhuman 
' 67084FL at about residues 380-886 of SEQ ID NO:23. A C-terminal large extramembrane 
domain was identified in the amino acid sequence ofhuman 67084alt at about residues 380- 
886 of SEQIDNO:26. 

35 In another embodiment, a 67076, 67102, 44181, 67084FL, or 67084alt protein or 

67076, 67102, 44181, 67084FL, or 67084alt extramembrane domain is characterized by at 
least one "P-type ATPase sequence 1 motif' in the protein or corresponding nucleic acid 
sequence. As used herein, a "P-type ATPase sequence 1 motif is a conserved sequence 

25 
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motif diagnostic for P-type ATPases (Tang, X et al. (1996) Science 272:1495-1497- Fagan 
M. J. and Saier, M. H. (1994) J. Mol EvoL 38:57). 'Amino acid residues of the P-type ' 
ATPase sequence 1 motif are involved in the coupling of ATP hydrolysis with transport 
(e.g., transport of phospholipids). The consensus sequence for a P-type ATPase sequence 1 
motif is PNSi-[QENR]-[SA]-[LIVSAN]-[LIV]-[TSN]-G-E-[SN] (SEQ IDNO:37) The 
use of amino acids in brackets indicates that the amino acid at the indicated position may be 
any one of the amino acids within the brackets; e.g., [SA] indicates any of one of either S 
(serine) or A (alanine). In a preferred embodiment, a P-type ATPase sequence 1 motif is 
contained within an N-terminal large extramembrane domain. In another preferred 
embodiment, a P-type ATPase 1 sequence 1 motif in the 67076, 67102, 44181, 67084FL or 
' 67084alt proteins ofthe present invention has at least 1,2, 3, or preferably 4 amino acid ' 
resides which match the consensus sequence for a P-type ATPase sequence 1 motif. A P- 
type ATPase sequence 1 motif was identified in the amino acid sequence of human 67076 at 
aboutresidues 173-181 ofSEQ ID NO:14. A P-type ATPase sequence I motif was 
identified in the amino acid sequence of human 67102 at about residues 208-216 of SEQ JD 
NO:17. A P-type ATPase sequence 1 motif was identified in the amino acid sequence of 
human44181 at about residues 173-181 of SEQ ID NO:20, A P-type ATPase sequence 1 
motif was identified in the amino acid sequence of human 67084FL at about residues 213 
221 ofSEQ ID NO:23. A P-type ATPase sequence 1 motif was identified in the amino acid 
sequence of human 67084alt at about residues 213-221 of SEQ ID NO:26. 

In another embodiment, a 67076, 67102, 44181, 67084FL, or 67084alt protein or 
67076, 67102, 44181, 67084FL, or 67084alt extramembrane domain is characterized by at 
least one "P-type ATPase sequence 2 motif 'in the protein or corresponding nucleic acid 
sequence. As used herein, a "P-type ATPase sequence 2 motif is a conserved sequence 
motif diagnostic for P-type ATPases (Tang, X. et al. (1996) Science 272:1495-1497- Fagan 
M. J. and Saier, M. H. (1994) J. Mol Evol. 38:57). Preferably, a P-type ATPase sequence 2 
motif overlaps with and/or includes an E1-E2 ATPases phosphorylation site (as defined 
herein). The consensus sequence for a P-type ATPase sequence 2 motif is [LIV]-[CAML]- 
[STFL]-D-K-T-G-T-[LI]-T (SEQ ID NO:38). The use of amino acids in brackets indicates 
that the amino acid at the indicated position may be any one of the amino acids within the 
brackets, e.g., [LI] indicates any of one of either L (leucine) or I (isoleucine). In a preferred 
embodiment, a P-type ATPase sequence 2 motif is contained within a C-terminal large 
extramembrane domain. In another preferred embodiment, a P-type ATPase sequence 2 
motif in the 67076, 67102, 44181, 67084FL, or 67084alt proteins of the present invention 
has at least 1 , 2, 3, 4, 5, 6, 7, 8, or more preferably 9 amino acid resides which match the 
consensus sequence for a P-type ATPase sequence 2 motif. A P-type ATPase sequence 2 
motif was identified in the amino acid sequence of human 67076 at about residues 406-415 
ofSEQ ID NO:14. A P-type ATPase sequenced motif was identified in the amino acid 
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sequence of human 671Q2 at about residues 435-444 of SEQ ID NO:17. A P-type ATPase 
sequence 2 motif was identified"™ the amino acid sequence of human 44.181 at about 
residues 404-41 3 of SEQ ID NO:20. A P-type ATPase sequence 2 motif was identified in 
the amino acid sequence of human 67084FX at about residues 413-422 of SEQ ID NO:23. 
5 A P-type ATPase sequence 2 motif was identified in the amino acid sequence of human 
67084alt at about residues 413-422 of SEQ ID NO:26. 

In yet another embodiment, a 67076, 67102, 4418*1, 67084FL, or 67084alt protein or 
67076, 67102, 44181, 67084FL, or 6708'4alt extramembrane domain" is characterized by'.at 
least one "P-type ATPase sequence 3 motif in the protein or corresponding nucleic acid 
10 sequence. As used herein, a "P-type ATPase sequence 3 motif is a conserved sequence 
motif diagnostic forP-type ATPases (Tang, X. et al. (1996) Science 272:1495-1497; Fagan, 
M. J. and'Saier, M H. (1994) J. Mol. Evol, 38:57). Amino acid residues of the P-type, 
ATPase sequence 3 motif are involved in ATP binding. The consensus sequence for a P- 
type ATPase sequence 3 motif is [TIV]-G-D-G-X-N-D-[ASG]-P-[ASV>L (SEQ ID 
15 NO:39). X indicates that the amino acid at the indicated position may be any amino acid 
(i.e., is not conserved). The use of amino acids in brackets indicates that the' amino acid at 
the indicated position may be any one of, the amino acids within the brackets, e.g., [TIV] 
indicates any of one of either T (threonine), I (isoleucine), or V (valine). In a preferred 
embodiment, a P-type ATPase sequenced motif is contained within a C-terminal large 
20 extramembrane domain. In another preferred embodiment, a P-type ATPase sequence 3 
motif in the 67076, 67102, 44181, 67084FL,*or 67084alt proteins of the present invention 
has at least 1, 2, 3, 4, 5, 6, or more preferably 7 amino acid resides (including the amino acid 
at the position indicated by "X") which match the consensus sequence for a P-type ATPase 
sequence 3 motif. A P-type ATPase sequence 3 motif was identified in the amino acid 
25 sequence of human 67076 at about residues 813-824 of SEQ ID NO:14. A P-type ATPase 
sequence 3 motif was identified in the amino acid sequence of human 67 1 02 at about 
residues 1054-1064 of SEQ ID NO:17. A P-type ATPase sequence 3 motif was identified in 
the amino acid sequence of human 44181 at about residues 819-829 of SEQ ID NO:20. A 
P-type ATPase sequence 3 motif was identified in the amino acid sequence of human 
30 67084FL at about residues 820-830 of SEQ ID NO:23. A P-type ATPase sequence 3 motif 
was identified in the amino acid sequence of human 67084alt at about residues 820-830 of 
SEQ ID NO:26. 

In another embodiment, a 67076, 67102, 44181, 67084FL, or 67084alt protein of the 
present invention is identified based on the presence of an "E1-E2 ATPases phosphorylation 
35 site" (alternatively referred to simply as a "phosphorylation site") in the protein or 

corresponding nucleic acid molecule. An E1-E2 ATPases phosphorylation site functions in 
accepting a phosphate moiety and has the amino acid sequence DKTGT (amino acid 
residues 4-8 of SEQ ID NO:38), and can be included within the E1-E2 ATPase 
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phosphorylation site consensus sequence: D-K-T-G-T-[LIVM]-[TI] (SEQ ID NO:41), 
•"• wherein D is phosphorylated. The use of amino acids in brackets indicates that the amino . 
acid at the indicated position may be any one of the amino acids within the brackets, e.g. ; 
[TI] indicates any of one of either T (threonine) or I (ispleucine). The E1-E2 ATPases 
5 phosphorylation site consensus sequence has been assigned ProSite Accession Number 
PS00 1 54. To identify the presence of an E 1 -E2 ATPases phosphorylation site consensus 
sequence in a 67076, 67 1 02, 44 1 8 1 , 67084FL; or 67084alt protein, and to make the 
determination that a protein of interest has a particular profile, the amino acid sequence of 
the protein may be searched against a database of known protein motifs (e.g. , the ProSite 
1 0 database) using the default parameters (available at the Prosite website). A search was 
> performed against the ProSite database resulting in the identification of an E 1 -E2 ATPases 
phosphorylation site consensus sequence in the amino acid sequence of human 67076 (SEQ 

ID NO: 14) at about residues 409-415. A search was performed against the ProSite database 
^ resulting in the identification of an E 1 -E2 ATPases phosphorylation site consensus sequence 

1 5 in the amino acid sequence of human 67 1 02 (SEQ ID NO: 1 7) at about residues 43 8-444. . A 
search was performed against the ProSite database resulting in the identification of an El -E2 
ATPases phosphorylation site consensus sequence in the amino acid sequence of human 
44 1 8 1 (SEQ ID NO:20) at about residues 407-4 1 3. A search was performed against the 
ProSite database resulting in the identification of an E l -E2 ATPases phosphorylation site 

20 consensus sequence in the amino acid sequence of human 67084FL (SEQ ID NO:23) at 
about residues 4 1 6-422. A search was performed against the ProSite database resulting in 
the identification of an El -E2 ATPases phosphorylation site consensus sequence in the 
amino acid sequence of human 67084alt (SEQ ID NO:26) at about residues 4 1 6-422. 
Preferably an E 1 -E2 ATPases phosphorylation site has a "phosphorylation site 

25 activity," for example, the ability to be phosphorylated; to be dephosphorylated; to regulate 
the E1-E2 conformational change of the phospholipid transporter in which it is contained; to 
• regulate transport of phospholipids (e.g., aminophospholipids such as phosphatidylserine 
and phosphatidylethanolamine, choline phospholipids such as phosphatidylcholine and 
sphingomyelin, and bile acids) across a cellular membrane by the 67076, ,67 1 02, 441 8 1 , 

30 67084FL, or 67084alt protein in which it is contained; and/or to regulate the activity (as 
defined herein) of the 67076, 67102, 44181, 67084FL, or 67084alt protein in which it is 
contained. Accordingly, identifying the presence of an "El -E2 ATPases phosphorylation 
site'? can include isolating a fragment of a 67076, 671 02, 441 81 , 67084FL, or 67084alt 
molecule (e.g., a 67076, 671 02, 441 81, 67084FL, or 67084alt polypeptide) and assaying for 

35 the ability of the fragment to exhibit one of the aforementioned phosphorylation site 
activities. 

In another embodiment, a 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt protein of the 
present invention may also be identified based on its ability to adopt an El conformation or 
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an E2 conformation. As used herein, an "El conformation" of a 67076, 67102, 44181 , 
67084FL, or 67084alt protein includes a 3 -dimensional conformation of a'67076, 67102, 
44181, 67084FL, or 67084alt protein which does not.exhibit 67076, 67}Q2, 44181, 
67084FL, qr 67084alt activity (e.g., the ability to transport phospholipids)^ as defined 

5 herein. An El conformation of a 67076, 67102, 44181, 67084FL, or 67084alt protein 
usually occurs when the 67076, 671 02, 44 1 8 1 , 67084FL, or 67084alt protein is 
unphosphorylated. As used herein, an "E2 conformation" of a 67076, 67102, 44181, 
67084FL, or 67084alt protein jncludes a 3-dimensional conformation of a 67076, 67102, 
44181, 67084FL,, or 67084alt protein which exhibits 67076, 67102, 44181, 67084FL, or 
.10 67084alt activity (e.g.,- the ability to transport phospholipids), as defined herein. An E2 

conformation of a 67676, 67 1 02,' 44 1 8 1 , 67084FL, or 67084alt protein usually occurs when 
the 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt protein is phosphorylated. 

In still another embodiment, a 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt protein 
of the present invention is identified based on the presence of "phospholipid transporter 
"~ 15 specific" amino acid residues. As used herein, "phospholipid transporter specific" amino 
acid residues are amino acid residues specific to the class of phospholipid transporting P- 
type ATPases (as defined in Tang, X. et al. (1996) Science 272;1495-1497). Phospholipid 
transporter specific amino acid residues are not found in those P-type ATPases which 
transport molecules which are not phospholipids (e.g., cations). For example, phospholipid 

20 transporter specific amino acid residues are found at the first, second, and fifth positions of 
the P-type ATPase sequence 1 motif. In phospholipid transporting P-type ATPases, the first 
position of the P-type ATPase sequence 1 motif is preferably E (glutamic acid), the second 
position is preferably T (threonine), and the fifth position is preferably L (leucine). A 
phospholipid transporter specific amino acid residue is further found at the second position 

25 of the P-type ATPase sequence 2 motif. In phospholipid transporting P-type ATPases, the 
second position of the P-type ATPase sequence 2 motif is preferably F (phenylalanine). 
Phospholipid transporter specific amino acid residues are still further found at the first, 
tenth, and eleventh positions of the P-type ATPase sequence 3 motif. In phospholipid 
transporting P-type ATPases, the first position of the P-type ATPase sequence 3 motif is 

30 preferably I (isoleucine), the tenth position is preferably M (methionine), and the eleventh 
position is preferably I (isoleucine). 

Phospholipid transporter specific amino acid residues were identified in the amino 
acid sequence of human 67076 (SEQ ID NO: 14) at about residues 174 and 177 (within the 
P-type ATPase sequence 1 motif), at about residue 407 (within the P-type ATPase sequence 

35 2 motif), and at about residues 8 1 3, 823, and 824 (within the P-type ATPase sequence 3 
motif). 

Phospholipid transporter specific amino acid residues were identified in the amino 
acid sequence of human *67 1 02 (SEQ ID NO: 1 7) at about residues 208, 209, and 2 1 2 (within 
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the P-type ATPase sequence 1 motif), at about residue 436 (within tlie P-type ATPase 
sequence 2 motif), and'at about residues 1 054, 1063, and 1 064 (within the P-type ATPase 
sequence 3 motif). . t , r • 

Phospholipid transporter specific ahhino acid residues were identified in the amino 
5 acid sequence of human 44181 (SEQ ID NO:20) at about residues 174 and 177 (within the 
P-type ATPase sequence 1 motif), at about residue 405 (within the P-type ATPase sequence 
2 motif), and at about residues 819, 828, and 829 (within the P-type ATPase sequenced 
motif). i , ' V • '* , 

Phospholipid transporter specific amino acid residues were identified in the amino 
10 acid sequence of human 67084FL (SEQ ID NO:23) at about residues 2 1 4 and 2 1 7 (within 
s the P-type ATPase sequence 1 motif) and at about residues 820, 829, and 830 (within the P- 
type ATPase sequence 3 motif). 

Phospholipid transporter specific amino acid residues were identified in the amino 
acid sequence of human 67084alt (SEQ ID jvfO:26) at about residues 214 and 217 (within 
15 the P-type ATPiase sequence 1 motif), and at about residues 820, 829, and 830 (within the P- 
type ATPase sequence 3 motif). ' 

Isolated polypeptides of the present invention, preferably 67076, 67 1 02, 44 1 8 1 , 
67084FL, or 67084alt polypeptides, have an amino acid sequence sufficiently identical to 
the amino acid sequence of SEQ ID NO;14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID 
20 NO:23, or SEQ ID NO:26 or are encoded by a nucleotide sequence sufficiently identical to 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, or SEQ ID NO:27. For example, 
amino acid or nucleotide sequences which share common structural domains having at least 
50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%,. 85%, 90%, 95%, 96%, 97%, 98%, 99% or 
25 more homology or identity across the amino acid sequences' of the domains and contain at 
least one and preferably two structural domains or motifs, are defined herein as sufficiently 
identical. Furthermore, amino acid or nucleotide sequences which share at least 50%, 55%, 
60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more 
~ : homology or identity and share a common functional activity are defined herein as 
30 sufficiently identical. 

In a preferred embodiment, a 67076, 671 02, 44 181, 67084FL, or 67084alt protein 
includes at least one or more of the following domains, sites, or motifs: a transmembrane 
domain, an N-terminal large extramembrane domain, a C-terminal large extramembrane 
domain, an E1-E2 ATPases phosphorylation site, a P-type ATPase sequence 1 motif, a P- 
35 type ATPase sequence 2 motif, a P-type ATPase sequence 3 motif, and/or one or more 
phospholipid transporter specific amino acid resides, and has an amino acid sequence at 
least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 85%, 90%, 95%, 96%, 97%, 98%, 
99% or more homologous or identical to the amino acid sequence of SEQ ID NO: 14, SEQ 
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ID NO: 1 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, or the amino acid sequence 
encoded by the DNA insert of the plasmid deposited with ATCC as Accession Number 

. , p or ; •; , , In yet another preferred embodiment, a 

67076, 67102, 44181, 67084FL, or 67084alt protein includes at least one or more of the 
5 following domains, sites, or motifs: a transmembrane .domain, an N-terminal large 
extramembrane domain, a C-terminal large extramembrane domain, an E1-E2 ATPases 
phosphorylation site, a P-type ATPase sequence 1 motif, a P-type ATPase sequenced motif, 
a P-type ATPase sequence 3 motif, and/or one or, more phospholipid transporter specific 
amino acid resides, and is encoded by a nucleic acid molecule having a nucleotide sequence 

10 which hybridizes under stringent hybridization conditions to a complement of a nucleic acid 
, molecule comprising the nucleotide sequence of SEQ ID NO:13, SEQ ID NO:15, SEQ IDt 
NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, 
SEQ ID NO:25,Nor SEQ ID NO:27. In another preferred embodiment, a 67076, 67102, 
1 ' 44 1 8 1 , 67084FL, or 67084alt protein includes at least one or more of the following 

15 domains, sites, or motifs: a transmembrane domain, an N-terminal large extramembrane 
domain, a C-terminal large extramembrane domain, an E1-E2 ATPases phosphorylation 
site, a P-type ATPase sequence 1 motif, a P-type ATPase sequence 2 motif, a P-type 
ATPase sequence 3 motif, and/or one or more phospholipid transporter specific amino acid 
resides, and has a 67076, 67102, 44181, 67084FL, or 67084alt activity. 

20 As used interchangeably herein, a "phospholipid transporter activity" or a "67076, 

67 1 02, 44 18 1 , 67084FL, or 67084alt activity" includes an activity exerted or mediated by a 
67076, 67102, 44181, 67084FL, or 67084alt protein, polypeptide or nucleic acid molecule 
on a 67076, 67102, 44181, 67084FL, or 67084alt responsive cell or on a 67076, 67102, 
44181, 67084FL, or 67084alt substrate, as determined in vivo ox in vitro, according to 

25 standard techniques. In one embodiment, a phospholipid transporter activity is a direct 
activity, such as an association with a 67076, 67102, 44181, 67084FL, or 67084alt target 
molecule. As used herein, a "target molecule" or "binding partner" is a molecule with 
which a 67076, 67102, 44181, 67084FL, or 67084alt protein binds or interacts in nature, 
such that 67076, 67102, 44181, 67084FL, or 67084alt-mediated function is achieved. In an 

30 exemplary embodiment, a 67076, 67102, 44181, 67084FL, or 67084alt target molecule is a 
67076, 67102, 44181, 67084FL, or 67084alt substrate (e.g., a phospholipid, ATP, or a non- 
' 67076, 671 02, 44 1 8 1 , 67084FL, or 67084alt protein). A phospholipid transporter activity 
can also be an indirect activity, such as a cellular signaling activity mediated by interaction 
of the 67076, 67102, 44181, 67084FL, or 67084alt protein with a 67076, 67102, 44181, 

35 67084FL, or 67084alt substrate. 

In a preferred embodiment, a phospholipid transporter activity is at least one of the 
following activities: (i) interaction with a 67076, 67102, 44181, 67084FL, or 67084alt 
substrate or target molecule (e.g., a phospholipid, ATP, or a non-67076, 67102, 441 81, 
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67084FL, or 67084alt protein); (ii) transport of a 67076, 671 02, 441 81, 67084FL, or 
67084alt substrate or target molecule {e.g., an aminophospholipid such as 
phosphatidylserine or phosphatidylethanolamine) from one side of a Cellular membrane to 
the other; (iii) the ability to be phosphorylated or dephosphorylatecl; (iv) adoption of an El 
5 conformation or an E2 conformation; (v) conversion of a 67076, 67 1 02, 44 1 8 1 , 67084FL, or 
67084alt substrate or target molecule to a product (e.g., hydrolysis of ATP); (vi) interaction 
with a second non-67076, 67102, 44181; 67084FL, or 67084alt protein; (vii) modulation of 
substrate or target molecule location (e.g., modulation of phbspholipid location within a cell 
and/or location wim respect to a cellular membrane); (viii) maintenance of . 

1 0 aminophospholipid gradients; '(ix) modulation of intra- or intercellular signaling and/or gene 
• transcription (e.g., either directly or indirectly); and/or (x) modulation of cellular ' 
proliferation, growth, differentiation, apoptosis, absorption, or secretion. 

The nucleotide sequence of the isolated human 67076, 67 1 02, 44 1 8 1 , 67084FL, or 
67084alt cDNA and the predicted amino acid sequence of the human 67076, 67102, 44181, 

1 5 67084FL, or 67084alt polypeptides are shown in Figures 20A : E, 24A-E, 28A-E, 32A-E, and 
36A-E, and in SEQ ID NOs:13 and 14, SEQ ID NOs:16 and 17, SEQ ID NOs:19 and 20, 
SEQ ID NOs:22 and 23, and SEQ ID NOs:25 and 26, respectively. Plasmids containing the 
nucleotide sequence encoding human 67076, human 67102, human 44181, human 67084FL, 
and/or human 67084alt were deposited with the American Type Culture Collection (ATCC), 

20 10801 University Boulevard, Manassas, VA 201 1 0-2209, on . . .. 

• and - respectively, and assigned Accession Numbers , ■ 



. and respectively. These deposits will be maintained under the 



terms of the Budapest Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes of Patent Procedure. These deposit were made merely as a 

25 convenience for those of skill in the art and are not admissions that a deposit is required 
under 35 U.S.C. §112 

The human 67076 gene, which is approximately 65 82 nucleotides in length, encodes 
a polypeptide which is approximately 1 1 29 amino acid residues in length. The human 
67 1 02 gene, which is approximately 6074 nucleotides in length, encodes a polypeptide 

30 which is approximately 1426 amino acid residues in length. The human 44181 gene, which 
is approximately 7221 nucleotides in length, encodes a polypeptide which is approximately 
1 177 amino acid residues in length. The human 67084FL gene, which is approximately 
4 1 98 nucleotides in length, encodes a polypeptide which is approximately 1 084 amino acid 
residues in length. The human 67084alt gene, which is approximately 423 1 nucleotides in 

35 length, encodes a polypeptide which is approximately 1 095 amino acid residues in length. 



32 



BNSDOCID: <WO_02055701A2_I_> 



WO 02/055701 PCTAJS01/49060 

Various aspects of the invention are. described in further detail in the following 
subsections: ■ ■ ■ 

L Isolated Nucleic Acid Molecules 

5 One aspect of the invention pertains to isolated nucleic acid molecules that encode 

8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides or 
biologically active portions thereof, as well as nucleic acid fragments sufficient for use as 
hybridization probes to identify 8099, 46455, 54414; 53763, 67076, 671 02, 44181, ' ' , 
67084FL, or 67084alt-encodirig nucleic acid molecules (e.g., 8099, 46455, 54414, 53763, 

10 67076, 671D2, 44181, 67084FL, or 67084alt mRNA) and fragments for use as PCR primers 
for the amplification or mutation of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt nucleic acid molecules. As used herein, the term "nucleic acid 
molecule" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA 
molecules (e.g. , mRNA) and analogs of the DNA or RNA generated using nucleotide 

15 analogs. The nucleic acid molepule can be single-stranded or double-stranded, but 
preferably is double-stranded DNA. • 1 i 

The term "isolated nucleic acid molecule" includes nucleic acid molecules which are 
separated from other nucleic acid molecules which are present in the natural source of the 
nucleic acid. For example, with regards to genomic DNA, the term "isolated" includes 

20 nucleic acid molecules which are separated from the chromosome with which the genomic 
DNA is naturally associated. Preferably, an '"isolated" nucleic acid is free of sequences 
which naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the 
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. 
For example, in various embodiments, the isolated 8099, 46455, 54414, 53763, 67076, 

25 67 1 02, 44 1 8 1 , 67084FL, or 67084alt nucleic acid molecule' can contain less than about 5 kb, 
4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which naturally flank the 
nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. 
Moreover, an "isolated" nucleic acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture medium when produced by 

30 recombinant techniques, or substantially free of chemical precursors or other chemicals 
when chemically synthesized. 

A nucleic acid molecule of the present invention, e.g. , a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO ;4, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 

35 IDNO:15,SEQIDNO:16,SEQIDNO:18,SEQIDNO:19,SEQIDNO:21,SEQID 

NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the 

DNA insert of the plasmid deposited with ATCC as Accession Number : , 

, , or or a portion thereof, can be isolated using standard 
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molecular biology techniques and the sequence information provided herein. Using all or a 
portion of the nucleic acid sequence of SEQ'ID NO:l, SEQ ID Nb:3, SEQ ID NO-4 SEO 

qfo rrf' ID NO: 7 ' ID NO:9 4 SEQ ID NO; 1 0, SEQ ID NO: 1 2,!sEQ^D NO ^ 
S EQlDNO:15,SE Q mNO^ 

NO:22,SEQIDNO:24, SEQIDNO-25 SEO ID NO-97 th v nu.21, SEQ ID 
r»MA • ^ r u , • Q IDNO.27, or the nucleotide sequence of the 

insert of ^ P Iasmid deposited with ATCC as Accession Number , 

— " ■ — ' or — , » as a hybridization probe; 8099, 46455 5441 4 • si7« 

^dardhybndtzatton and cloning techniques (e.g., as described in Sambreok, J FritshE 

j???* T ' T C " hr C !°'"" g: A Oratory Manual 2m, U, Coding ' 
Harbor Laboratory, Cold Spring Harbor laboratory Press, Cold Spring Harbor, NY 1989) 

JtT™' " nUCkiC m ° leCUle ^1 or a portion of SEQ ID NO i ' 

SEQIDNO:3,SEQIDNO:4, SEQIDNO.-6, SEQIDNO:7, SEQJDN09 SEQ ID 

NO: 1 0,SEQIDNO: I 2, S EQIDNO:13, S EQIDNO :15 ,SEQIDNO:16s'EOmNO 18 

NO:27, or the nucleotide sequence of the DNA insert of the plaWd deposited with ATGC 
as Accession Number ' 

f , , _ . — i- — 5 — ,or_ can be isolated by 

the polymerase cham reaction (PGR) using synthetic oligonucleotide primers designed based 
upon the seque^e of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO-6 SEQ ID 
NO:7, SEQ ID NO* SEQ ID NO: 10, SEQ IDNQ: 12 , SEQ ,D NO.B.^EQ S 
SEQ ID NO: 16, SEQ ID NQ:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO-22 SEQ ID 
NO:24 SEQ ID NO:25, SEQ IDNO:27, or the nucleotide sequence of the DNA to^of toe 
plasmid deposited with ATCC as Accession Number 

or ' ' ' 

A nucleic acid of the invention can be amplified using cDNA mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 
according to standard PCR amplification techniques. The nucleic acid so amplified can be 
cloned into an appropriate vector and characterized by DNA sequence analysis 

L U ^T^^ 8 ° nUCle0tideS COITCS P° ndin 8 to 8099 > 4 6455, 54414, 53763, 67076, 67102, 
44 8 1 , 67084FL, or 67084ait nucleotide sequences can be prepared by standard synthetic 
techniques, e.g. , using an automated DNA synthesizer. 

to one embodiment, an isolated nucleic acid molecule of the invention comprises the ' 
nucleohdesequenceshownfaSEQIDNO:!. The sequence of SEQ ID NO:l corresponds 
to the human 8099 cDNA. This cDNA comprises sequences encoding the human 8099 
polypepude (i.e., "the coding region", from nucleotides 180-2034) as well as 5' untranslated 
sequences (nucleotides 1-179) and 3' untranslated sequences (nucleotides 2035-2725) 
Alternatively, toe nucleic acid molecule can comprise only the coding region of SEQ ID 
NO: 1 (e.g., nucleotides 1 80-2034, corresponding to SEQ ID NO:3). Accordingly, in another 
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embodiment, the isolated nucleic acid molecule comprises SEQ ID NO:3 and nucleotides 1- 
179 and 2035-2725 of SEQ ID NO:l. In yet another embodiment, the nucleic acid molecule 
consists of the nucleotide sequence set forth as SEQ IDNO^l or SEQ ID NO:3. 

■ In another embodiment, an isolated nucleic acid molecuie of the invention comprises 
5 the nucleotide sequence shown in SEQ ID NO:4. The sequence of SEQ ID N0.4 

corresponds to the human 46455 cDNA. This cDNA comprises sequences encoding the 
human 46455 polypeptide (U, "the coding region", from nucleotides 376-1963) as well as 
5' untranslated sequences '(nucleotides 1-375) and 3' untranslated sequences (nucleotides 
1964-2230). Alternatively, the nucleic acid molecule can comprise only the coding region 
10 of SEQ ID NO:4 (e.g., nucleotides 376-1963, corresponding to SEQ ID NO:6). 

, Accordingly, in another embodiment, the isolated nucleic acid molecule comprises SEQ ID 
NO:6 and nucleotides 1-375 and 1964-2230 of SEQ ID NO:4. In yet another embodiment, 
the nucleic acid molecule consists of the nucleotide sequence set forth as SEQ IDNO:4 or 
' SEQIDNO:6. 

15 i n another embodiment, an isolated nucleic acid molecule of the invention comprises 

the nucleotide sequence shown in SEQ ID NO:7. This cDNA may comprise sequences 
encoding the human 54414 protein (e.g., the "coding region", from nucleotides 225-3578), 
as well as 5' untranslated sequence (nucleotides 1-224) and 3' untranslated sequences 
(nucleotides 3579-4632) of SEQ ID NO:7. Alternatively, the nucleic acid molecule can 

20 comprise only the coding region of SEQ ID NO:7 (e.g., nucleotides 225-3578, 

corresponding to SEQ ID NO:9). Accordingly, in another embodiment, an isolated nucleic 
acid molecule of the invention comprises SEQ ID NO:9 and nucleotides 1-224 of SEQ ID 
NO:7. In yet another embodiment, the isolated nucleic acid molecule comprises SEQ ID 
NO:9 and nucleotides 3579-4632 of SEQ ID NO:7. In yet another embodiment, the nucleic 

25 acid molecule consists of the nucleotide sequence set forth as SEQ ID NO:7 or SEQ ID 
NO:9. 

In still another embodiment, the cDNA may comprise sequences encoding the 
human 53763 protein (e.g., the "coding region", from nucleotides 561-2474), as well as 5' 
untranslated sequence (nucleotides 1-560) and 3 ' untranslated sequences (nucleotides 2475- 
30 2847) of SEQ ID NO: 10. Alternatively, the nucleic acid molecule can comprise only the 
coding region of SEQ ID NO:10 (e.g., nucleotides 561-2474, corresponding to SEQ ID 
' NO:6). Accordingly, in another embodiment, an isolated nucleic acid molecule of the 
invention comprises SEQ ID NO: 12 and nucleotides 1-560 of SEQ ID NO:10. In yet 
another embodiment, the isolated nucleic acid molecule comprises SEQ ID NO: 12 and 
35 nucleotides 2475-2847 of SEQ ID NO:10. In yet another embodiment, the nucleic acid 

molecule consists of the nucleotide sequence set forth as SEQ ID NO:l0 or SEQ IDNO:12. 

In yet another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO:13. The sequence of SEQ ID 
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NO: 13 corresponds to the human 67076 cDNA. This cDNA comprises sequences encoding 
the human 67076 polypeptide (z.e., "the coding region", from nucleotides 524-3910) as well 
as 5' untranslated sequences (nucleotides 1-523) and 3* untranslated sequences (nucleotides 
391 1-6582)* Alternatively, the nucleic aciJi' molecule can comprise only the coding region 
of SEQ ID NO:13 (e.g., nucleotides 524-3910, corresponding to SEQ ID NO:15). 
Accordingly, in another embodiment, the isolated nucleic acid molecule comprises SEQ ID 
NO:15 and nucleotides 1-523 or 391 1-6582 of SEQ ID WO:13. In yet another Embodiment, 
the nucleic acid molecule consists of the nucleotide sequence set forth as SEQ ID N6:13 or 
SEQ ID NO: 15. . 

In artother embodiment, an isolated nucleic acid molecule of the invention comprises 
the nucleotide sequence shown in SEQ ID NO: 1 6. The sequence of SEQ ID NO: 1 6 
corresporlds to the human 67102 cDNA. This cDNA comprises sequences encoding the 
human 67102 polypeptide (i.e., "the coding region", from nucleotides 274-4551) as well as 
5 * untranslated sequences (nucleotides 1-273) and 3 * untranslated sequences (nucleotides 
4552-6074). Alternatively, the nucleic acid molecule can comprise only the coding region 
of SEQ ID NO:16 (e.g., nucleotides 274-4551; corresponding to SEQ ID NO: 18). 
Accordingly, in another embodiment, the isolated nucleic acid molecule comprises SEQ ID 
NO: 1 8 and nucleotides 1 - 273 or 4552-6074 of SEQ ID NO: 16. In yet another embodiment, 
the nucleic acid molecule consists of the nucleotide sequence set forth as SEQ ID NO: 16 or 
SEQ ID NO: 18. 

In still another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO: 19. The sequence of SEQ ID 
NO: 19 corresponds to the human 44181 cDNA. This cDNA comprises sequences encoding 
the human 44181 polypeptide (i.e., "the coding region", from nucleotides 167-3697) as well 
as 5' untranslated sequences (nucleotides 1-166) and 3* untranslated sequences (nucleotides 
3698-7221). Alternatively, the nucleic acid molecule can comprise only the coding region 
of SEQ IDNO:19 (e.g., nucleotides 167-3697, corresponding to SEQ IDNO:21). 
Accordingly, in another embodiment, the isolated nucleic acid molecule comprises SEQ ID 
NO:21 and nucleotides 1-166 or 3698-7221 of SEQ ID NO:19. In yet another embodiment, 
the nucleic acid molecule consists of the nucleotide sequence set forth as SEQ ID NO: 19 or 
SEQ ID NO:21. 

In yet another embodiment, an isolated nucleic acid molecule of the invention 
comprises the nucleotide sequence shown in SEQ ID NO:22. The sequence of SEQ ID 
NO:22 corresponds to the human 67084FL cDNA. This cDNA comprises sequences 
encoding the human 67084FL polypeptide (i.e., "the coding region", from nucleotides 156- 
3407) as well as 5' untranslated sequences (nucleotides 1-155) and 3' untranslated 
sequences (nucleotides 3408-4198). Alternatively, the nucleic acid molecule can comprise 
only the coding region of SEQ ID NO:22 (e.g., nucleotides 156-3407, corresponding to SEQ 
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ID NO:24). Accordingly, in another embodiment, the isolated nucleic acid molecule 

■"■ comprises SEQ ID NO:24 and nucleotides 1-155 or 3408-4198 of SEQ ID NO:22. In yet , 
another embodiment, the nucleic acid molecule consists of the nucleotide sequence set forth 
asSEQIDNO:22orSEQIDNO:24. 
5 In a further embodiment, an isolated nucleic acid molecule of the invention 

comprises the nucleotide sequence shown in SEQ ID NO:25. The sequence of SEQ ID 
NO:25 corresponds to the human 67084alt cDNA. This cDNA comprises sequences 
encoding the human 67084alt polypeptide (/.£., "the coding fegion", from nucleotides 156- 
3440) as well as 5' untranslated sequences (nucleotides 1-155) and 3 5 untranslated 

10 sequences (nucleotides 3441-4231). Alternatively, the nudeic acid molecule can comprise 
i only the coding region of SEQ ID NO:25 (e.g., nucleotides 156-3440, corresponding to SEQ 
ID NO:27). Accordingly, in another embodiment, the isolated nucleic acid molecule 
comprises SEQ ID NO:27 and nucleotides 1-155 or 3441-4231 of SEQ ID NO:25. In yet 
another embodiment, the nucleic acid molecule consists of the nucleotide sequence set forth 
^ 15 as SEQ ID NO:25 or SEQ ID NO:27. 

In still another embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence shown 
in SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NO.10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, 

20 SEQIDNO:18, SEQ ID NO: 19, SEQ ID NO :21, SEQ ID NO :22, SEQ ID NO :24, SEQ ID 
NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA insert of the plasmid 

deposited with ATCC as Accession Number , , , or 

, or a portion of any of these nucleotide sequences. A nucleic acid molecule which 

is complementary to the nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:3, SEQ 

25 lb NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID NO:19, SEQ ID 
. NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 
, , , , of . is one which is sufficiently 

30 complementary to the nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:13, SEQ ID NO:15, SEQ ID NO:l6, SEQ ID NO: 18, -SEQ ID NO:19, SEQ ID 
NO;2 1 , SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

35 , ^ . , - • or , such that it can hybridize to the 

nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, 
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I - ■ 

SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA 

insert of the plasmid deposited with ATCC as Accession Number ,_ ■ 

— — _> _> °T , thereby forming' a stable duplex. 

'In still another preferred embodiment, an isolated nucleic acid molecule of the 
5 present invention comprises a nucleotide sequence which is at least about 50%, 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 99% or more identical to the 
nucleotide sequence shown in SEQ ID NO.l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:7, SEQ IDNO:9, SEQ IDNO:10, SEQ ID NO:12i SEQ IDNO:13, SEQ ID ' 
NO:15, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID NO:19, SEQ IDNO:21, SEQIDNO:22, 
10 SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27 (e.g., to the entire length of the nucleotide 
• sequence), or to the nucleotide sequence (e.g., the entire length of the nucleotide sequence) 
of the DNA insert of the plasmid deposited with ATCC as Accession Number 

-> or or a portion of any of these nucleotide sequences. 



In one embodiment, a nucleic acid molecule of the present invention comprises a nucleotide 
15 sequence which is at least (or no greater than) 50-100, 100-250, 250-500, 500-750, 750- 
1000, 1000-1250, 1250-1500, 1500-1750, 1750-2000, 2000-2250,2250-2500, 2500-2750, 
2750-3000, 3000-3250, 3250-3500, 3500-3750, 3750-4000, 4000-4250, 4250^4500, 4500- 
4750, 4750-5000, 5000-5250, 5250-5500, 5500-5750, 5750-6000, 6000-6250, 6250-6500, 
6500-6750, 6750-7000, 7000-7250, 7250-7500 or more nucleotides in length and hybridizes 
20 under stringent hybridization conditions to a complement of a nucleic acid molecule of SEQ 
ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ 
ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:27, or the nucleotide sequence of the DNA insert of the plasmid deposited with 
25 ATCC as Accession Number , nr 

Moreover, the nucleic acid molecule of the invention can comprise only a portion of 
the nucleic acid sequence of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 

SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ Id' 
NO:15, SEQ ID NO:16, SEQ ID NO:18; SEQ ID NO:19, SEQ ID NO:21„ SEQ ID NO:22, 
30 SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA ' 

insert of the plasmid deposited with ATCC as Accession Number _, , 

or _ > for example, a fragment which can be used as a probe or 



primer or a fragment encoding a portion of a 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt polypeptide, e.g., a biologically active portion of a 8099, 
35 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt polypeptide. The 

nucleotide sequence determined from the cloning of the 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt gene allows for the generation of probes and primers 
designed for use in identifying and/or cloning other 8099, 46455, 544 1 4, 53763, 67076, 
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67102, 44181, 67084FL, or 67084alt family members, as well as 8099, 46455, 54414, 
53763, 67076, 67 102,, 44 181, 67084FL, or 67084alt homologues from other species. The 
probe/primer typically comprises substantially purified oligonucleotide. The probe/primer 
{e.g., oligonucleotide) typically comprises, a region of nucleotide sequence that hybridizes 

5 under stringent conditions to at least about 12 or 15, preferably about 20 or 25, more 
preferably about 30, 35, 40, 45, 50, 55, 60, 65, 75, 80, 85, 90, 95, or 100 or more 
consecutive nucleotides of a sense sequence of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID ( NO:7, SEQ ID NO:9,.SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:l3, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:l 8, SEQ ID NO:19, SEQ ID 

10 NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide 
sequence of the DNA 'insert of the plasmid deposited with ATCC as Accession Number 

t , . , or ______ of an anti-sense sequence of SEQ ID 

NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO:10, SEQ ID NO:12, SEQ IDNO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, 

1 5 SEQ ID NO: 1 9, SEQ ID NO:2 1, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:27 or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

Accession Number ■• _ 3 , . or , or of a naturally 

occurring allelic variant or mutant of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 

20 ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID 

NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the 

DNA insert of the plasmid deposited with ATCC as Accession Number , 

, .or 

Exemplary probes or primers are at least 12, 1 5, 20, 25, 30, 35, 40, 45, 50, 55, 60, 
25 * 65, 70, 75 or more nucleotides in length and/or comprise consecutive nucleotides of an 
isolated nucleic acid molecule described herein. Probes based on the 8099, 46455, 544 14, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleotide sequences can be used to 
detect (e.g., specifically detect) transcripts or genomic sequences encoding the same or 
homologous polypeptides. In preferred embodiments, the probe further comprises a label 

30 group attached thereto, e.g. , the label group can be a radioisotope, a fluorescent compound, 
an enzyme, or an enzyme co-factor. In another embodiment a set of primers is provided, 
e.g., primers suitable for use in a PCR, which can be used to amplify a selected region of a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt sequence, e.g., a 
domain, region, site or other sequence described herein. The primers should be at least 5, 

35 10, 20, 30, 40, 50, 60, 70, 80, 90, 100 or more nucleotides in length. Such probes can be 
used as a part of a diagnostic test kit for identifying cells or tissue which misexpress a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide, such as by 
measuring a level of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
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67084alt -encoding nucleic acid in a sample of cells from a subject e.g., detecting 8099, 
46455, 54414, 53763,' 67076, 67102, 44181', 67084FL, or 67084alt mRNA levels Or 
determining whether a genomic 8099, 46455, 54414,.53763, 67076, 67102, 44181, 
67084FL, or 67084alt gene has been mutated or deleted. 
5 A nucleic acid fragment encoding a "biologically active portion of a 8099, 46455, 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide" can be prepared 
by isolating a portion of the nucleotide sequence of SEQ'ID NO:l, SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID.NO.7, SEQ ID NO:9,SEQ ID NO: 10, SEQ ID NO: 12, SEQ 
ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 

10 NO:21, SEQ ID NO:22, SEQ IDNO:24,SEQ IDNO:25, SEQ ID NO:27, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

' ' , . or , which encodes a polypeptide having a 

8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt biological activity 
(the biological activities of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 

15 or 67084alt polypeptides are described herein), expressing the encoded portion of the 8099, 
46455, 54414, 53763, 67076^ 67102, 44181, 67084FL, or 67084alt polypeptide (e.g., by 
recombinant expression in vitro) and assessing the activity of the encoded portion of the 
8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084ah polypeptide. In 
an exemplary embodiment, the nucleic acid molecule is at least 50-100, 100-250, 250-500, 

20 500-750, 750-1000, 1000-1250, 1250-1500, 1500-1750, 1750-2000, 2000-2250, 2250-2500, 
2500-2750, 2750-3000, 3000-3250, 3250-3500, 3500-3750, 3750-4000, 4000-4250, 4250- 
4500, 4500-4750, 4750-5000, 5000-5250, 5250-5500, 5500-5750, 5750-6000, 6000-6250, 
6250-6500, 6500-6750, 6750-7000, 7000-7250, 7250-7500 or more nucleotides in length 
and encodes a polypeptide having a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

25 67084FL, or 67084alt activity (as described herein). 

The invention further encompasses nucleic acid molecules that differ from the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID N0.13, SEQ ID 
NO:15, SEQ IDNO:l6, SEQ IDNO:18; SEQ IDNO:19, SEQ IDNO:21, SEQ ID NO:22, 

30 SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA 
insert of the plasmid deposited with ATCC as Accession Number _ . , ; '.. , 

_, ,or . Such differences can be due to due to degeneracy of the 

genetic code, thus resulting in a nucleic acid which encodes the same 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides as those encoded by the 

35 nucleotide sequence shown in SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO/.6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:22, 
SEQ ID NO:24, SEQ ID NO-.25, SEQ ID NO:27, or the nucleotide sequence of the DNA 
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insert of the plasmid deposited with ATCC as Accession Number : , 

J or • . In another embodiment, an isolated nucleic acid molecule; 

of the invention has a nucleotide sequence encoding a polypeptide having an amino acid 
sequence which differs by at least 1, but no greater than 5, 10, 20, 50 or 100 amino acid 
residues from the amino acid sequence shown in SEQ ID NO:2, SEQ ID NO:5, SEQ ID 
NO:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or 
SEQ ID NO:26, or the amino acid sequence encoded by the DNA insert of the plasmid 

deposited with the ATCC as Accession Number j , 1 . _ 9 or 

. In yet another embodiment, the nucleic acid molecule encodes the amino acid 

sequence of human 8099, 46465, 54414, 53763, 67076,; 67102, 44181, 67084FL, or 
67084alt . If an alignment is needed for this comparison, the sequences should be aligned 1 
for maximum homology. 

Niicleic acid variants can be naturally occurring, such as allelic variants (same locus), 
homologues (different locus), and orthologues (different organism) or can be non naturally 
occurring. Non-naturally occurring variants can be made by mutagenesis techniques, 
including those applied to polynucleotides, cells, or organisms. The variants can contain 
nucleotide substitutions, deletions, inversions and insertions. Variation can occur in either or 
both the coding and non-coding regions. The variations can produce both conservative and 
non-conservative amino acid substitutions (as compared in the encoded product). 

Allelic variants result, for example, from DNA sequence polymorphisms within a 
population (e.g., the human population) that lead to changes in the amino acid sequences of 
the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides. 
Such genetic polymorphism in the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt genes may exist among individuals within a population due to natural 
allelic variation. As used herein, the terms "gene" and "recombinant gene" refer to nucleic 
acid molecules which include an open reading frame encoding a 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt polypeptide, preferably a mammalian 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide, and can 
further include non-coding regulatory sequences, and introns. 

Accordingly, in one embodiment, the invention features isolated nucleic acid 
molecules which encode a naturally occurring allelic variant of a polypeptide comprising the 
' amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 11, SEQ 
ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, or an amino 
acid sequence encoded by the DNA insert of the plasmid deposited with ATCC as 

Accession Number , _ ; , or • , wherein the nucleic 

acid molecule hybridizes to a complement of a nucleic acid molecule comprising SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ IDNO:15, SEQ ID NO:16, SEQ ID NO:18, 
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SEQ ID N0:19, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:27, for example, under stringent hybridization I conditions. 

Allelic variants of human 8099, 46455, 544,14, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt include both functional and non-functional 8099, 46455, 54414' 
5 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides. 

Functional allelic variants are naturally occurring amino acid sequence variants of 
the human 8099 or 46455 polypeptides that have an 8099 or 46455 activity, e.g., maintain 
the ability to bind an 8099 or 46455 ligand or substrate and/or modulate sugar transport, or 
sugar homeostasis. 

10 Functional allelic variants are naturally occurring amino acid sequence variants of 

i the human 54414 or 53763 polypeptides that maintain the ability to, e.g., bind or interact • 
with a 54414 or -53763 target molecule and/or modulate membrane excitability. 

Functional allelic variants are naturally occurring amino acid sequence variants of 
the human 67076, 67102, 44181, 67084FL, or 67084alt polypeptides that have a 67076 
67102, 44181, 67084FL, or 67084alt activity, e.g., bind or interact with a 67076, 67102,. 
44181, 67084FL, or 67084alt substrate or target molecule, transport a 67076, 67102, 44181, 
67084FL, or67084alt substrate or target molecule across a cellular membrane, hydrolyze ' 
ATP, be phosphorylated or dephosphorylated, adopt an E 1 conformation or an E2 
conformation, and/or modulate cellular signaling, growth, proliferation, differentiation, 
20 absorption, or secretion. 

Functional allelic variants will typically contain only conservative substitution of 
one or more amino acids of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO- 1 1 

SEQIDNO:14,SEQIDNO:17,SEQIDNO:20,SEQIDNO:23,orSEQIDNO:26,or ' 
substitution, deletion or insertion of non-critical residues in non-critical regions of the 
25 polypeptide. 

Non-functional allelic variants are naturally occurring amino acid sequence variants 
of the human 8099 or 46455 polypeptides that do not have a 8099 or 4645 5 activity, e.g. , 
maintain the ability to bind an 8099 or 46455 ligand or substrate and/or modulate sugar ' 
transport, or sugar homeostasis. 

Non-functional allelic variants are naturally occurring amino acid sequence variants 
of the human 54414 or 53763 polypeptides that do not maintain the ability to, e.g., bind or 
interact with a 54414 or 53763 target molecule and/or modulate membrane excitability. 

, Non-functional allelic variants are naturally occurring amino acid sequence variants 
of the human 67076, 67102, 44181, 67084FL, or 67084alt polypeptides that do not have a 
67076, 67102, 44181, 67084FL, or 67084alt activity, e.g., that do not have the ability to, 
e.g, bind or interact with a 67076, 67102, 44181, 67084FL, or 67084alt substrate or target 
molecule, transport a 67076, 67102, 44181, 67084FL, or 67084alt substrate or target 
molecule across a cellular membrane, hydrolyze ATP, be phosphorylated or 
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dephosphqrylated, adopt an El conformation or an E2 conformation^ and/or modulate 
cellular signaling, growth, proliferation, differentiation, absorption, or secretion. 

Non-functional allelic variants will typically contain a non-conservative substitution, 
a deletion, or insertion or premature truncation of the amino, acid sequence of SEQ ID NO:2, 
5 SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO: 14, SEQ ID NO:17, SEQ ID 
NO:20, SEQ ID NO:23, or SEQ ID NO:26, or a substitution, insertion or deletion in critical 
residues or critical regions. . 1 

The present invention further provides non-human orthologues of the human. 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides. 
10 Orthologues of human 8099 or 46455 polypeptides are polypeptides that are isolated from 
non-human organisms and possess the same 8099 and/or 46455 activity, e.g., ligand binding 
and/or mpdulation of sugar transport mechanisms, as the human 8099 and/or 46455 
polypeptide. Orthologues of the human 54414 or 53763 polypeptides are polypeptides that 
are isolated from non-human organisms and possess the same 54414 or 53763 target 
molecule binding mechanisms and/or ability to modulate membrane excitability of the 
human 54414 or 53763 polypeptides. Orthologues of human 67076, 67102, 44181, 
67084FL, or 67084alt ■ polypeptides are polypeptides that are isolated from non-human 
organisms and possess the same 67076/67102, 44181, 67084FL, or 67084alt substrate or 
target molecule binding mechanisms, phospholipid transporting activity, ATPase activity, 
20 and/or modulation of cellular signaling mechanisms of the human 67076, 67 1 02, 44 1 81 , 
67084FL, or 67084alt proteins as the human 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptides. 

Orthologues of the human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptides can readily be identified as comprising an amino acid 
25 sequence that is substantially identical to SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ 
ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID 
NO:26. 

Moreover, nucleic acid molecules encoding other 8099, 46455, 54414, 53763, 
67076, 67102, 441 8 1, 67084FL, or 67084alf family members and, thus, which have a 

30 nucleotide sequence which differs from the 8099, 46455, 544 1 4, 53763, 67076, 67 1 02, 
44181, 67084FL, or 67084alt sequences of SEQ ID NO:l, SEQ IDNO:3, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
NO:13, SEQIDNO:15, SEQIDNO:16, SEQIDNO:18, SEQIDNO:19, SEQIDNO:21, 
SEQ ID NO:22, SEQ ID NO:24; SEQ ID NO:25, SEQ ID NO:27, or the nucleotide 

35 sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

' j j , or . are intended to be within the scope of the 

invention. For example, another 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt cDNA can be identified based on the nucleotide sequence of human 
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8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . Moreover, 
nucleic acid molecules encoding 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptides from different species, andwhich, thus,' have a 
nucleotide sequence which differs from the 8099, 46455, 54414^ 53763, 67076, 67102,' 
44 1 8 1 , 67084FL, or 67084alt sequences of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID * 
NO.13, SEQ ID N0.15, SEQ ID NO:16; SEQ ID NO.18, SEQ ID N0.19 SEQ ID N021 
SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25 » SEQ ID NO:27, or the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCG as Accession Number 

— ' -' » — ^ or are intended to be within the scope of the 

i invention. For example, a mouse 8099, 46455, 54414, 53763, 67076, 67102 44181 • 
67084FL, or 67084alt cDNA can be identified based on the nucleotide sequence of a human 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . 

Nucleic acid molecules corresponding to natural allelic variants and homologues of 
15 the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt cDNAs of the 
invention can be isolated based on their homology to the 8099, 46455, 54414, 53763, 67076 
67102, 44181, 67084FL, or 67084alt nucleic acids disclosed herein using the cDNAs ' 
disclosed herein, or a portion thereof, as a hybridization probe according to standard 
hybridization techniques under stringent hybridization conditions. Nucleic acid molecules 
corresponding to natural allelic variants and homologues of the 8099, 46455, 54414 53763 
67076, 67102, 44181, 67084FL, or 67084alt cDNAs of the invention can further be' isolated 
by mapping to the same chromosome or locus as the 8099, 46455, 54414, 53763, 67076 
67102, 44181, 67084FL, or 67084alt gene. 

Orthologues, homologues and allelic variants can be identified using methods known 
25 in the art (e.g., by hybridization to an isolated nucleic acid molecule of the present 

invention, for example, under stringent hybridization conditions). In one embodiment, an 
isolated nucleic acid molecule of the invention is at least 15, 20, 25, 30 or more nucleotides 
in length and hybridizes under stringent conditions to the nucleic acid molecule comprising 
the nucleotide sequence of SEQ ID NO:i, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 
NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID N0 22 
SEQ ID N0:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA ' 
insert of the plasmid deposited with ATCC as Accession Number 



20 



30 



35 



-' or • In othe r embodiment, the nucleic acid is at least 50-1 00, 



100-250,250-500,500-750,750-1000, 1000-1250, 1250-1500, 1500-1750, 1750-2000 
2000-2250, 2250-2500, 2500-2750, 2750-3000, 3000-3250, 3250-3500, 3500-3750 3750- 
4000, 4000-4250, 4250-4500, 4500-4750, 4750-5000, 5000-5250, 5250-5500, 5500-5750 
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5750-6000, 6000-6250, 6250-6500, 6500-6750, 6750-7000, 7000-7250, 7250-7500 or more 
nucleotides in length.' * 

As used herein y the term "hybridizes under stringent conditions", is intended to 
describe conditions for hybridization and hashing under which nucleotides that 
5 are significantly identical or homologous to each other remain hybridized to each other. 
Preferably, the conditions are such that sequences at least about 70%, more preferably at 
least about 80%, even more preferably at least about 85% or 90% identical to each other 
remain hybridized to each' other. Such stringent conditions are known to those skilled in the 
art and can be found in Current Protocols in Molecular Biology, Ausubel etal, eds., John 
10 Wiley & Sons, Inc. (1995), sections 2, 4 and 6. Additional stringent conditions can be 
found in Molecular dloning: A Laboratory Manual, Sambrook et al. , Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1 989), chapters 7, 9 and 1 1 . A preferred, non-limiting 
example of stringent hybridization conditions includes hybridization in 4X sodium 
Chloride/sodium citrate (SSC), at about 65-70°C (or hybridization in 4X SSC plus 50% 
15 formamide at about 42-50 o C)fpllowed by one or more washes in IX SSC, at about 65- 
70°C. A preferred, non-limiting example of highly stringent hybridization conditions 
includes hybridization' in IX SSC, at about 65-70°C (or hybridization in IX SSC plus 50% 
formamide at about 42-50°C) followed by one or more washes in 0.3X SSC, at about 65- 
70°C. A preferred, non-limiting example of reduced stringency hybridization conditions 
20 includes hybridization in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X 
SSC plus 50% formamide at about 40-45°C) followed by one or more washes in 2X SSC, at 
about 50-60°C. Ranges intermediate to the above-recited values, e.g. , at 65-70°C or at 42- 
50°C are also intended to be encompassed by the present invention. SSPE (lxSSPE is 
0.15M NaCl, lOmM NaH 2 P0 4 , and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
25 (lxSSC is 0.15MNaCl and 15mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 1 5 minutes each after hybridization is complete. The 
hybridization temperature for hybrids anticipated to be less than 50 base pairs in length 
should be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 18 base pairs in 
30 length, T m (°C) - 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 1 8 and 49 
base pairs in length, T m (°C) ± 81.5 + 16.6(log 10 [Na + ]) + 0.41(%G+C) - (600/N), where N is 
the number of bases in the hybrid, and [Na + ] is the concentration of sodium ions in the 
hybridization buffer ([Na + ] for lxSSC = 0. 165 M). It will also be recognized by the skilled 
practitioner that additional reagents may be added to hybridization and/or wash buffers to 
35 decrease non-specific hybridization of nucleic acid molecules to membranes, for example, 
nitrocellulose or nylon membranes, including but not limited to blocking agents (e.g., BSA 
or salmon or herring sperm carrier DNA), detergents (e.g. , SDS), chelating agents (e.g., 
EDTA), Ficoll, PVP and the like. When using nylon membranes, in particular, an additional 
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preferred, non-limiting example of stringent hybridization conditions is hybridization in 
0.25-0.5MNaH 2 PO 4 , >7% SDS a'tabout 65°C, followed by one or more washes at 0.Q2M 
NaH 2 P0 4 , 1% SDS at 65°C, see e.g., Church and Gilbert (1984) Proc. Natl Acad. Sci. USA 
81:1991-1995, (or alternatively 0.2X SSC,'1% SDS). 
5 Preferably, an isolated nucleic acid molecule of the invention that hybridizes under 

stringent conditions to the sequence of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID N04 SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO.10,' SEQ ID NO:12, SEQ ID NO" 13 
SEQ ID NQ.15, SEQ ID ^0:16, SEQ ID NO: 18, SEQ ID NOM9, SEQ ID N0 21 SEQ ID 
NO:22, SEQIDNO.-24, SEQIDNO.-25, SEQID NO:27, and corresponds to a naturally- 
10 occurring nucleic acid' 

molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that occurs in nature (e.g., encodes a 
natural polypeptide). ' 

In addition to naturally-occurring allelic variants of the 8099, 46455 54414 53763 
67076, 67102, 44181, 67084FL, or 67084alt sequences that may exist in the population, the 
skilled artisan will further appreciate that changes can be introduced by mutation into the 
nucleotide sequences of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO 6 SEQ 
ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO- 15 
SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID N0 22 SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession'Number ' 



15 



25 



30 



35 



or 



_, thereby leading to changes in the amino acid sequence of the encoded 8099, 



46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides, without 
altering the functional ability of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptides. , For example, nucleotide substitutions leading to amino 
acid substitutions at "non-essential" amino acid residues can be made in the sequence of SEQ 
ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO-9 SEQ 
ID NO:10, SEQ IDNO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ Id' 
N0.18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:27, or the nucleotide sequence of the DNA insert of the plasmid deposited with 
ATCC as Accession Number 



.of A "non- 



essential" amino acid residue is a residue that can be altered from the wild-type sequence of 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt (e.g., the sequence 
of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NOH SEQ ID 
NO.I7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26) without altering the 'biological 
activity, whereas an "essential" amino acid residue is required for biological activity. 

For example, amino acid residues that are conserved among the 8099 or 46455 
polypeptides of the present invention, e.g., those present in a transmembrane domain and/or a 
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sugar transporter family domain, are predicted to be particularly unamenable to alteration. 
Furthermore, additional amino acid residues that are conserved between the 8099 or 46455 _ 
polypeptides of the present invention, and other members of the,8099 or 46455 family are not 
likely >to be amenable to alteration. 
5 Amino acid residues that are conserved among the 54414 or 53763 polypeptides of 

the present invention, e.g. , those present in a P-loop or a pore domain, are predicted to be 
particularly unamenable to alteration. Furthermore, additional amino acid residues that are 
conserved between the 54414 or 53763 polypeptides of the present invention and other 
members of the potassium channel family are not likely to be amenable to alteration. 
10 Amino acid residues that are conserved among die 67076, 67102, 44181, 67084FL, or 

, 67084alt polypeptides of the present invention, e.gv, those present in a El-E2ATPases • 
phosphorylation .site, are predicted to be particularly unamenable to alteration. Furthermore, 
additional amino acid residues that are conserved between the 67076, 671Q2, 44181, 
' ' 67084FL, or 67084alt polypeptides of the present invention and other members of the 
15 phospholipid transporter family are not likely to be amenable to alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding 8099, 46455, 5441 4, 53763, 67076, 67102, 441 8 1 , 67084FL, or 67084alt 
polypeptides that contain changes in amino acid residues that are not essential for activity. 
Such 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides 
20 differ in amino acid sequence from SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID 
NO:l 1 , SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, 
yet retain biological activity. In one embodiment, the isolated nucleic acid molecule 
comprises a nucleotide sequence encoding a polypeptide, wherein the polypeptide comprises 
an amino acid sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 
25 95%, 96%, 97%, 98%, 99% or more identical to SEQ ID NO:2, SEQ ID NO:5, SEQ ID 

NO:8, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NO: 17, SEQ ID NO:20, SEQ ID NO:23, or 
SEQ ID NO:26 (e.g. , to the entire length of SEQ IDNQ:2, SEQ ID NO:5, SEQ ID NO:8, 
SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ 
IDNO-.26). 

30 An isolated nucleic acid molecule encoding a 8099, 46455, 54414, 53763, 67076, 

67102, 44181, 67084FL, or 67084alt polypeptide identical to the polypeptide of SEQ ID 
NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 1 1, SEQ ID NO:l4, SEQ ID NO:17, SEQ 
ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the nucleotide sequence of SEQ ID NO: 1 , 

35 SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, 
SEQ ID NO: 12, SEQ ID NO:13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID 
NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or 
the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as Accession 
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Number __, _ — , _ } ^ or . such that one or more amino acid 

substitutions, additions or deletions are introduced' into the encoded polypeptide. Mutations 
can be introduced into SEQ ID NO:l„ SEQID NO;3, SEQ ip NO:4, SEQ ID NO:6, SEQ ID 
NO:7i SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:12; SEQ ID NO:13, SEQ ID NO:15, SEQ 
ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID ' 
NO:24, SEQ ID NO:25, SEQ ID NO:27, or the nucleotide sequence of the DNA insert of the 
plasmid deposited with ATCC as Accession Number . " ; 



or 



by standard techniques, such as site-directed mutagenesis and PCR-mediated 



mutagenesis. Preferably, conservative amino acid substitutions are made at one or more 
10 predicted non-essential amino acid residues. A "conservative amino acid substitution" is one 
, in which the amino acid residue is replaced with an amino acid residue having a similar side 
chain. Families-of amino acid residues having similar side chains have been defined in the 
art. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
15 (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine, tryptophan), 
nonpolar side chains (e.g., alanine, valine, leucine, isoleucine' proline, phenylalanine, ' 
methionine), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
20 67084alt polypeptide is preferably replaced with another amino acid residue from the same 
side chain family. Alternatively, in another embodiment, mutations can be introduced 
randomly along all or part of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt coding sequence, such as by saturation mutagenesis, and the resultant mutants ' 
can be screened for 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
25 biological activity to identify mutants that retain activity. Following mutagenesis of SEQ ID 
NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO: 6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO.10, SEQIDNO:12, SEQIDNO.13, SEQIDNO-15, SEQID N0.16, SEQIDNO.18, 
SEQ ID NO.19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:27, or the nucleotide sequence of the DNA insert of the plasmid deposited with ATCC as 

30 Accession Number ■ 

— ' ' -' ' » » , , or 

_, the encoded polypeptide can be expressed recombinantly and the activity of the 



35 



polypeptide can be determined. 

In a preferred embodiment, a mutant 8099 and/or 46455 polypeptide can be assayed 
for the ability to (1) bind a monosaccharide, e.g. , D-glucose, D-fructose, D-galactose, and/or 
mannose, (2) transport monosaccharides across a cell membrane, (3) influence insulin and/or 
glucagon secretion, (4) maintain sugar homeostasis in a cell, (5) function as a neuronal 
transporter, and (6) mediate trans-epithelial movement in a cell. 
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In another preferred embodiment, a mutant 54414 and/or 53763 protein can be 
assayed for the ability to ^interact with a 54414 and/or 53763 fcubstrate'(e.g., a potassium 
ion or a cyclic nucleotide); (ii) conduct or transport a,54414 and/or 5376} substrate across a 
cellular mejribrane; (iii) interact with a second non-54414 and/or 53763 protein {e.g., a 
5 54414 and/or 53763 polypeptide or a 54414 and/or 53765 -potassium channel subunit); (iv) 
modulate (e.g., maintain and/or rectify) membrane potentials; (v) regulate target molecule 
availability or activity; (vi) modulate intra^ or intercellular signaling and/or gene 
transcription (e.g., either clireqtly or indirectly); (viii) generate outwardly rectifying' Currents; 
(viii) modulate membrane excitability; (ix) modulate the release of neurotransmitters; (x) 
1 0 regulate contractility fe.g., of smooth muscle cells), secretion, and/or synaptic transmission; 
and/or (xi) modulate processes which underlie learning and memory. 

In a further preferred embodiment, a mutant 544 1 4 protein can be assayed for the 
ability to (i) interact with maxi-K potassium channels (i.e., large conductance channels, in 
particular Slo)\ (ii) modulate maxi-K potassivm channel activity (e.g., S/o-mediated 
15 activities); (iii) generate intermediate conductance channels; and/or (iv) regulate 

contractility (e.g., of smooth muscle cells), secretion, and/or synaptic transmission, in 
particular, via modulation of Slo. ■ ■ 

In still a further preferred embodiment, a mutant 53763 protein can be assayed for 
the ability to (i) interact with Shaker (Sh) potassium channels and/or channel subunits; (ii) 
20 modulate Shaker (Sh) potassium channel activity (e.g., termination of prolonged membrane 
depolarization); and/or (iii) modulation of high voltage activating channel' activity and/or 
inactivating channel activity, and the like. 

In yet another preferred embodiment, a mutant 67076, 67102, 44181, 67084FL, 
and/or 67084alt polypeptide can be assayed for the ability to (i) interact with a 67076, 
25 67 1 02, 44181, 67084FL, or 67084alt substrate or target molecule (e.g., a phospholipid, 

ATP, or a non-67076, 67102, 44181, 67084FL, or 67084alt protein); (ii) transport a 67076, 
67102, 44181, 67084FL, or 67084alt substrate or target molecule (e.g., an 
aminophospholipid such as phosphatidylserine or phosphatidylethanolamine) from one side 
of a cellular membrane to the other; (iii) be phosphorylated or dephosphorylated; (iv) adopt 
30 an El conformation or an E2 conformation; (v) convert a 67076, 67102, 44181, 67084FL, or 
67084alt substrate or target molecule to a product (e.g. , hydrolysis of ATP); (yi) interact 
with a second non-67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt protein; (vii) modulate 
substrate or target molecule location (e.g., modulation of phospholipid location within a cell 
and/or location with respect to a cellular membrane); (viii) maintain aminophospholipid 
35 gradients; (ix) modulate intra- or intercellular signaling and/or gene transcription (e.g. , 
either directly or indirectly); and/or (x) modulate cellular proliferation, growth, 
differentiation, apoptosis, absorption, or secretion. 
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In addition to the nucleic acid molecules encoding 8099, 46455, 54414, 53763, 
67076, 671 02, 44 1 8 1 67084FL',' or 67084alt polypeptides described above, another aspect 
of the invention pertains to isqlated nucleic acid molecules which are antisense thereto. In 
an exemplary embodiment, the invention provides an isolated nucleic acid molecule which 
is antisense to a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
nucleic acid molecule (e.g., is antisense to the coding strand of a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid molecule). An "antisense" 
nucleic acid comprises a nucleotide sequence which 'is complementary to a "sense" nucleic 
acid encoding a polypeptide, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. Accordingly, an antisense 
nucleic acid can hydrogen bond to a sense nucleic acid, the antisense nucleic acid can be 
complementary to an entire 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt coding strand, or to only a portion thereof. In one embodiment, an antisense 
nucleic acid molecule is antisense to a " coding region" of the coding strand of a nucleotide 
15 sequence encoding 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt. The term "coding region" refers to the region of me nucleotide sequence 
comprising codons which are translated into amino acid residues (e.g., the coding region of 

human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
corresponds to SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:12, and SEQ ID 
20 NO: 15, SEQ ID NO: 18, SEQ ID NO:21, SEQ ID NO:24, and SEQ ID NO:27, respectively). 
In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding 
region" of the coding strand of a nucleotide sequence encoding 8099, 46455, 54414, 53763 
67076, 67102, 44181, 67084FL, or 67084alt . The term "noncoding region"'refers to 5' and 
3' sequences which flank the coding region that are not translated into amino acids (/. e. , also 
25 referred to as 5' and 3* untranslated regions). 

Given the coding strand sequences encoding 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt disclosed herein (e.g., SEQ ID NO:3, SEQ ID NO:6, 
SEQ ID NO:9, SEQ ID NO: 12, and SEQ ID NO:15, SEQ ID NO:18, SEQ ID NO:21, SEQ 
. ID NO:24, and SEQ ID NO:27), antisense nucleic acids of the invention can be designed 
according to the rules of Watson and Crick base pairing. The antisense nucleic acid molecule 
can be complementary to the entire coding region of 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt mRNA, but more preferably is an oligonucleotide' 
which is antisense to only a portion of the coding or noncoding region of 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA. For example, the 
antisense oligonucleotide can be complementary to the region surrounding the translation 
start site of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA 
(e.g, between the -10 and +10 regions of the start site of a gene nucleotide sequence). An 
antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
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nucleotides in length. An antisense nucleic acid of the invention can be constructed using 
chemical synthesis and enzymatic ligation reaction's using procedures known in the art. For 
example, an antisense nucleic acid {e^g:, an antisense Oligonucleotide) can be chemically 
synthesized using naturally occurring nucleotides or variously modified nucleotides designed 
5 to increase the biological stability of the molecules or to increase the physical stability of the 
duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. Examples of modified 
nucleotides which can be used to generate the antisense nucleic acid include 5-fluorouracil, 
5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 4-acetylcytbsine, 5- 
10 (carboxyhydroxylmethyl) uraeil, 5-carboxymemylaminomemyl-2-thiouridine, 5- 

. carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-» 
isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
' ' methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
15 mannosylqueosine, 5-methoxycarboxymethyluracil, 5 -methoxy uracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense 
20 nucleic acid can be produced biologically using an expression vector into which a nucleic 
acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted 
nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
25 subject or generated in situ such that they hybridize with or bind to cellular mRNA and/Or 
genomic DNA encoding a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
. 67084alt polypeptide to thereby inhibit expression of the polypeptide, e.g., by inhibiting 
transcription and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic 
30 acid molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. An example of a route of administration of antisense nucleic 
' acid molecules of the invention include direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then 
administered systemically. For example, for systemic administration, antisense molecules 
35 can be modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 
antibodies which bind to cell surface receptors or antigens. The antisense nucleic acid 
molecules can also be delivered to cells using the vectors described herein. To achieve 
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sufficient intra-cellular concentrations of the antisense molecules, vector constructs in which 
the antisense nucleic acid moledule is placed under the control of a strong-pol II or pol III 
promoter are preferred; , ' ' 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
5 a-anomeric nucleic acid molecule. An a-anomeric nucleic acid moleculeforms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual p-units 
the strands run parallel to each other (Gaultiere/ al. (1987) Nucleic Acids. Res: 15:6625- ' 
6641). The antisense nucleic acid molecule can also' comprise a 2'-o-methylribonucIeotide 
(Inoue e/a/. (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
10 (Inoue etah (1987) FEBS Lett. 215:327-330). ' 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme 
Ribozymes are eatalytic RNA.molecules with ribonuclease activity which are capable of 
v cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in 
Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, o'r 67084alt mRNA transcripts 

54414, 53763, 67076, 67102, 44181, 67084FL 
or 67084alt mRNA. A ribozyme having specificity for a 8099-, 46455-, 54414. 53763- 
67076-, 67102-, 44181-, 67084FL-, or 67084alt -encoding nucleic acid can be designed ' 
based upon the nucleotide sequence of a 8099, 46455, 54414, 53763, 67076, 67102 44181 
67084FL, or 67084alt cDNA disclosed herein (i.e., SEQ ID NO.l, SEQ ID N03 SEQ ID ' 
NO:4, SEQ ID NO:6, SEQ IDNO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID N012 SEQ 
ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID N019 SEQ ID 
NO:2I, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or 'the nucleotide 
sequence of the DNA insert of the plasmid deposited with ATCC as Accession Number 

— — — • ' » — — ' Dr : )• Fpr example, a derivative of a 

Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in a 8099- 46455- 
54414-, 53763-, 67076-, 67102-, 44181-', 67084FL-, or 67084alt -encoding mRNA. See' 
e.g., Cech etal. U.S. Patent No. 4,987,071; and Cech et al U.S. Patent No 5 116 742 
Alternatively, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
mRNA can be used to select a catalytic RNA having a specific ribonuclease activity from a 

pool of RNA molecules. See, e.g., Bartel, D. and Szostak, J.W. (1993) Science 261 141 1- 
1418 



20 



35 



Alternatively, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt gene expression can be inhibited by targeting nucleotide sequences 
complementary to the regulatory region of the 8099, 46455, 54414, 53763, 67076 67102 
44181, 67084FL, or 67084alt (e.g., the 8099, 46455, 54414, 53763, 67076, 67102 44181 
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67084FL, or 67084alt promoter and/or enhancers) to form triple helical structures that 
, prevent transcription of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or, 
67084alt gene in target cells. See generally, Helene, G. (\99l) Anticancer Drug Des. 
6(6):569-84; Helene, C. et al. (1992) Ann. N. Y. Acad. &/. 660:27-36; and Maher, L.J. 
5 (1992) Bioassays 14(12):807-15. 

In yet another embodiment the. 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 
67084FL, or 67084alt nucleic acid molecules of the present invention can be modified at 
the base moiety, sugar moiety or phosphate backbone to improve, e.g. , the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 
10 backbone of the nucleic acid molecules can be modified to generate peptide nucleic acids 
, (see Hyrup. B. et al. (1996) Bioorganic & Medicinal Chemistry 4 (1): 5-23). As used herein, 
the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, 
in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and 
' ' only the four natural nucleobases are retained. The neutral backbone of PNAs has been 
15 shown to allow for specific hybridization to DNA and RNA under conditions of low ionic 
strength. The synthesis of PNA oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup B. et al. (1996) supra; Perry-O'Keefe et 
al. Proc. Natl. Acad. Set 93: 14670-675. 

PNAs of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
20 nucleic acid molecules can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, for example, inducing transcription or translation arrest or inhibiting 
replication. PNAs of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt nucleic acid molecules can also be used in the analysis of single base pair 
25 mutations in a gene, (e.g., by PNA-directed PCR clamping); as 'artificial restriction 
enzymes' when used in combination with other enzymes, (.e.g. , SI nucleases (Hyrup B. 
(1996) supra)); or as probes of primers for DNA sequencing or hybridization (Hyrup B. et 
al. (1996) supra; Perry-O'Keefe supra). 

In another embodiment, PNAs of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
30 67084FL, or 67084alt can be modified, (e.g. , to enhance their stability or cellular uptake), 
by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA 
" chimeras, or by the use of liposomes or other techniques of drug delivery known in the art. 
For,example, PNA-DNA chimeras of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt nucleic acid molecules can be generated which may combine the 
35 advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 

enzymes, (e.g., RNase H and DNA polymerases), to interact with the DNA portion while the 
PNA portion would provide high binding affinity and specificity. PNA-DNA chimeras can 
be linked using linkers of appropriate lengths selected in terms of base stacking, number of 
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bonds between the nucleobases, and orientation (Hyrup B. (1996) supra). The synthesis of 
PNA-DNA chimeras can be performed as described in Hyrup B. (1996) supra and Finn P J 
etal. {1996) Nucleic Acids Res. 24 (1,7): 3357-63. , For examples DNA chain can be . ' 
synthesized on a solid support using standard phosphoramidite coupling chemistry and' 
5 modified nucleoside analogs, e.g., 5'-(4-methoxytrityl) amino-S'-deoxy-thymidine 

phosphoramidite, can be used as a between the PNA and the 5' end of DNA (Mag, M et al. 
(1989) Nucleic Acid Res. 17: 5973-88). PNA monomers are then coupled in a stepwise 
manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn 
P.J. et al. (1996) supra). Alternatively, chimeric molecules can be synthesized with a 5' 
10 DNA segment and a 3' PNA segment (Peterser, K.H. et al (1975) Bioorganic Med Chem 
i Lett. 5: 1119-11124). 

In other embodiments, the oligonucleotide may include other appended groups such 
as peptides (eg., for targeting host cell receptors in vivo), or agents facilitating transport 
across the cell membrane (see, e.g., Letsinger et al. (1989) Proc. Natl. Acad. Sci. USA 
15 86:6553-6556; Lemaitre et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; PCT 
Publication No. W088/09810) or the blood-brain barrier (see ; e.g. , PCT Publication No. 
W089/10134). In addition, oligonucleotides can be modified with hybridization-triggered 
cleavage agents (See, e.g., Krol et al. (1988) Bio-Techniques 6:958-976) or intercalating 
agents. (See, e.g., Zon (1988) Pharm. Res. 5:539-549). To this end, the oligonucleotide 
may be conjugated to another molecule, (e.g., a peptide, hybridization triggered cross- 
linking agent, transport agent, or hybridization-triggered cleavage agent). 

Alternati vely, the expression characteristics of an endogenous 8099, 46455 544 1 4 
53763, 67076, 67102, 44181, 67084FL, or 67084alt gene within a cell line or 
microorganism may be modified by inserting a heterologous DNA regulatory element into 
the genome of a stable cell line or cloned microorganism such that the inserted regulatory 
element is operatively linked with the endogenous 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt gene. For example, an endogenous 8099 46455 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene which is normally ' 
"transcriptionally silent", ie., a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

67084FL,or67084alt gene which is normally not expressed, or is expressed only at very 
low levels in a cell line or microorganism, may be activated by inserting a regulatory 
' element which is capable of promoting the expression of a normally expressed gene product 
in that cell line or microorganism. Alternatively, a transcriptionally silent, endogenous 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene may be 
activated by insertion of a promiscuous regulatory element that works across cell types. 

A heterologous regulatory element may be inserted into a stable cell line or cloned 
microorganism, such that it is operatively linked with an endogenous 8099 46455 54414 
53763, 67076, 67102, 44181, 67084FL, or 67084alt gene, using techniques, such as 
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targeted homologous recombination, which are well known to those' of skill in the art, and 
described, e.g., in Chappel/U.Sv Patent No. '5,272,071; PCT publication No. WO 91/06667, 
published May 16„ 199,1. , , r 

."if . 

5 II. Isolated 8099, 46455, 54414, 537.63, 67076, 67102, 44181. 67084FL, or 67084alt 
Polypeptides and Anti-8099, 46455. 54414, 53763, 67076, 67102, 44181, 67084FL, or 

67084alt Antibodies 

One aspect of the invention pertains to isolated 8099, 46455,- 54414, 53763," 670,76, 
67102, 44181, 67084FL, or 67084alt or recombinant polypeptides and polypeptides, and 

10 biologically active portions thereof, as well as polypeptide fragments suitable for use as 
immunogens to raise anti-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt. antibodies. In one embodiment, native 8099, 46455, 54414, 53763, 67076, 67102, 
44181 , 67084FL, or 67084alt polypeptides can be isolated from cells or tissue sources by an 
appropriate purification scheme using standard protein purification techniques. In another 

15 embodiment, 8099, 46455, 544^4, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptides are produced by recombinant DNA techniques. Alternative to. recombinant 
expression, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide or polypeptide can be synthesized chemically using standard peptide synthesis 
techniques. 

20 An "isolated" or "purified" polypeptide or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or tissue 
source from which the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide is derived, or substantially free from chemical precursors or other 
chemicals when chemically synthesized. The. language "substantially free of cellular 
25 material" includes preparations of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide in which the polypeptide is separated from cellular 
components of the cells from which it is isolated or recombinantly produced. In one 
embodiment, the language "substantially free of cellular material" includes preparations of 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide 
30 having less than about 30% (by dry weight) of non-8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide (also referred to herein as a 
"contaminating protein"), more preferably less than about 20% of non-8099, 46455, 544 14, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide, still more preferably less 
than about 10% of non-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
35 67084alt polypeptide, and most preferably less than about 5% non-8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide. When the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or biologically 
active portion thereof is recombinantly produced, it is also preferably substantially free of 
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culture medium, i.e., culture medium represents less than about 20%; more preferably less 
than about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. , ( 

The.language "substantially free of chemical precursors or other chemicals" includes 
S preparations of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypept,de in which the polypeptide is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the polypeptide. In one embodiment; the 
language "substantially free of chemical-precursors or other chemicals" includes ' ' 
preparations of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084al't 
»■ polypep tl de having less than about 30% (by dry weight) of chemical precursors or non- 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67Q84FL, or 67084alt chemicals more 
preferably less than about 20% chemical precursors or non-8099, 46455 54414 53 76 3 
67076, 67102, 44181, 67084FL, or 67084alt chemicals, still more preferably less than' 'about 
10/o chemical precursors or non-8099, 46455, 54414, 53763, 67076, 67102, 44181 
67084FL, or 67084alt chemicals, and most preferably less than about 5% chemical' 
precursors or non-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
chemicals. 

As used herein, a "biologically active portion" of a 8099, 46455 54414 53763 
67076, 67102, 44181, 67084FL, or 67084alt polypeptide includes a fragment of a 8099 ' 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide which ' 
participates m an interaction between a 8099; 46455, 54414, 53763, 67076, 67102, 44181 
67084FL, or 67084alt molecule and a non-8099, 46455, 54414, 53763, 67076 67102 
44181, 67084FL, or 67084alt molecule (e.g., a 8099, 46455, 54414, 53763, 67076 67102 
44181, 67084FL, or 67084alt substrate). Biologically active portions of a 8099 46455 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide include' peptides 
comprising amino acid sequences sufficiently identical to or derived from the amino acid 
sequence of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide, e.g., the amino acid sequence shown in SEQ ID NO:2, SEQ ID N0 5 SEQ ID 
NO:8, SEQ ID NO:l 1, SEQ ID NO:l4, SEQ ID NO: 17, SEQ ID NO:20, SEQ ID NO:23, or 
SEQ ID NO.-26, which include less amino acids than the full length 8099 46455 54414 ' 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides, and exnibit at least one 
activity of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide. 

Typically, biologically active portions of a 8099 or 46455 polypeptide comprise a 
domain or motif with at least one activity of the 8099 or 46455 polypeptide, e.g, modulating 
sugar transport mechanisms. A biologically active portion of an 8099 polypeptide can be a 
polypeptide which is, for example, 25, 30, 35, 40, 45, 50, 75, 100, 125, 150, 175 200 225 

250, 275, 300, 325, 350, 375 400 425 4SO 47S snn ^ ' . ' ' ' 

» "^^> > 4 /5j 500, 525, 550, 600 or more amino acids in 
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length A biologically active portion of ah 46455 polypeptide can be a polypeptide which is, 
for example, 25, 30, 35, 40, 45, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, , 
350, 375, 400, 425, 450, 475, 500, 525 or more amino acids, in length. Biologically active 
portions of an 8099 and/or an 46455 polypeptide can be used as targets for developing 
agents which modulate an 8099 or 46455 mediated activity, e.g., a sugar transport 
mechanism. 

In one embodiment, a biologically active portion of an 8099 or an 46455 polypeptide 
comprises at least one transmembrane domain. lit is to be understood that a preferred 
biologically active portion of an 8099 or an 46455 polypeptide of the present invention 
comprises at least one or more of the following domains: a transmembrane domain and/or a 
sugar transporter family domain. Moreover, other biologically active portions, in which ' 
other regions of the polypeptide are deleted, can be prepared by recombinant techniques and 
evaluated for one or more of the functional activities of a native 8099 or 46455 polypeptide. 

Moreover, biologically active portions of a 544 1 4 or 53763 polypeptide comprise a 
domain or motif with at least one activity of the 54414 or 53763 polypeptide, e.g., 
modulation of intra- or inter-cellular signaling and/or gene expression, and/or modulate 
membrane excitability. A biologically active portion of a 54414 or 53763 polypeptide can 
be a polypeptide which is, for example, 10, 25, 50, 75, 100, 125, 150 or more amino acids in 
length. Biologically active portions of a 54414 or 53763 polypeptide can be used as targets 
for developing agents which modulate a 54414 or 53763 mediated activity, e.g., modulation 
of intra- or inter-cellular signaling and/or gene expression, and/or modulate membrane 
excitability. 

In one embodiment, a biologically active portion of a 54414 or 53763 polypeptide 
comprises at least one transmembrane domain and/or a pore domain. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one or more of the functional 
. activities of a native 5441 4 or 53763 polypeptide. 

Biologically active portions of a 67076, 67102, 44181, 67084FL, or 67084alt 

polypeptide comprise a domain or motif with at least one activity of the 67076, 67102, 
44181, 67084FL, or 67084alt polypeptide, e.g., the ability to interact with a 67076, 67102, 
441 81, 67084FL, or 67084alt substrate or target molecule (e.g., a phospholipid; ATP; a non- 
' 67076, 67102, 441 81, 67084FL, or 67084alt protein; or another 67076, 67102, 44181, 
67084FL, or 67084alt protein or subunit); the ability to transport a 67076, 67102, 44181, 
67084FL, or 67084alt substrate or target molecule {e.g. , a phospholipid) from one side of a 
i cellular membrane to the other; the ability to be phosphorylated or dephosphorylated; the 
ability to adopt an El conformation or an E2 conformation; the ability to convert a 67076, 
67102, 44181, 67084FL, or 67084alt substrate or target molecule to a product (e.g., the 
ability to hydrolyze ATP); the ability to interact with a second non-67076, 67102, 44181, 
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67084FL, or 67.084alt protein; the ability to modulate intra- or inter-cellular signaling anaVor 
gene transcription (e.g. , either directly or indirectly); the ability to modulate cellular growth, 
proliferation, differentiation, absorption, and/or secretion. A biologically active portion of a 
6707(9, 67102, 441 81, 67084FL, or 67084alt polypeptide can be a polypeptide which is, for 
5 example, 10, 25, 50, 75, 100, 125, 150, 175, 200, 250; 300, 350, 400, 450, 500, 550, 600, 
650, 700, 750, 800, 850, 900, 950, 1000, 1050, 1 100, 1 150, 1200, 1250, 1300, 1350, 1400, 
1 450, 1 500, 1 550 or more amino acids in length. Biologically active portions of a 67076, 
67102, 441 81 , 67084FL, or 67084alt polypeptide can be used as targets for developing 

agents which modulate a 67076, 67102, 44181, 67084FL, or 67084alt mediated activity, 
10 e.g., modulating transport of biological molecules ; across membranes. 

i In. one embodiment, a biologically active portion of a 67076, 67102, 44181, 1 

67084FL, or 67084alt polypeptide comprises at least one at least one or more of the 
following domains, sites, or motifs: a transmembrane domain, an N-terminal large 
extramembrane domain, a C-terminal large extramembrane domain, an El -E2ATPases 

1 5 phosphorylation site, a P-type ATPase sequence 1 motif, a P-type ATPase sequence 2 motif, 
a P-type ATPase sequence 3 motif, and/or one or more phospholipid transporter specific 
amino acid resides. Moreover, other biologically active portions, in which other regions of 
the polypeptide are deleted, can be prepared by recombinant techniques and evaluated for 
one or more of the functional activities of a native 67076, 67 1 02, 44 1 8 1 , 67084FL, or 

20 67084alt polypeptide. 

Another aspect of the invention features fragments of the polypeptide having the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO.l 1, SEQ 
ID NO: 14, SEQ ID NO- 1 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, for 
example, for use as immunogens. In one embodiment, a fragment comprises at least 5 

25 amino acids (e.g., contiguous or consecutive amino acids) of the amino acid sequence of 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID 
NO: 1 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, or an amino acid sequence 
encoded by the DNA insert of the plasmid deposited with the ATCC as Accession Number 
> » ■ ■ - or . In another embodiment, a fragment 

30 comprises at least 10, 15, 20, 25, 30, 35, 40, 45, 50 or more amino acids (e.g., contiguous or 
consecutive amino acids) of the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ 
IDNO:8, SEQ IDNOrll, SEQ ID NO: 14, SEQ IDNO:17, SEQ IDNO:20, SEQ ID NO:23, 
or SEQ ID NO :26, or an amino acid sequence encoded by the DNA insert of the plasmid 
deposited with the ATCC as Accession Number - . ' , __, or 

35 . 

In a preferred embodiment, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide has an amino acid sequence shown in SEQ ID NO:2, 
SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID 
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NO:20, SEQ ID NO:23, or SEQ ID NO:26. In other embodiments, the 8099, 46455 v 54414, 
53763, 67076, 671 02j 441 81, 67084FL, or 67084alt polypeptide is substantially identical to 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID ^P:14, .SEQ ID 
NO: 17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO^ and retains the functional 
5 activity of the polypeptide of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, 
SEQIDNO:14, SEQIDNO:17,SEQ IDNO:20,SEQIDNO:23,orSEQlDNO:26,yet 
differs in amino acid sequence due to natural allelic variation or mutagenesis, as described in 
detail in subsection I above. Ip another 'embodiment, the 8099, 46455, 54414, 53763, . 
67076, 67102, 44181, 67084FL, or 67084alt polypeptide is a polypeptide which comprises 
10 an amino acid sequence at least about 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, 99% or more identical to SEQ ID NO:2, SEQ ID NO:5, SEQ ID 
NO:8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or 
SEQ ID NO:26. 

In another embodiment, the invention features a 8099, 46455, 54414, 53763, 67076, 
15 67102, 44181, 67084FL, or 67Q84alt polypeptide which is encoded by a nucleic acid 
molecule consisting of a nucleotide sequence at least about 50%, 55%, 60%, 65%, 70%, 
75%, 80%, 85%, 90%; 95%, 96%, 97%, 98%, 99% or more identical to a nucleotide 
sequence of SEQ ID NO: 1, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, 
SEQ ID NO:?, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID NO:13, SEQ ID NO:15, SEQ It) 
20 NO: 16, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:27, or a complfement thereof. This invention further features a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide 
which is encoded by a nucleic acid molecule consisting of a nucleotide sequence which 
hybridizes under stringent hybridization conditions to a complement of a nucleic acid 
25 molecule comprising the nucleotide sequence of SEQ ID NO:l , SEQ ID NO:3, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:13, SEQ ID NO: 15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or a complement 
thereof 

30 To determine the percent identity of two amino acid sequences or of two nucleic acid 

sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-identical sequences can be disregarded for comparison 
purposes). In a preferred embodiment, the length of a reference sequence aligned for 

35 comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, 
even more preferably at least 60%, and even more preferably at least 70%, 80%, or 90% of 
the length of the reference sequence (e.g. , when aligning a second sequence to the 8099 
amino acid sequence of SEQ ID NO:2 having 617 amino acid residues, at least 1 85, 
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preferably at least 246, more preferably at least 308, more preferably at least 370, even more 
preferably at least 43 1 , and even more preferably at least 493 or 555 or more amino acid . 
residues are aligned. In another preferred embodiment, the sequences being aligned for 
comparison purposes are globally aligned and percent identity is determined over the entire 
5 length of the sequences aligned . The amino acid residues or nucleotides at corresponding 
amino acid positions or nucleotide positions are then compared. When a position in the first 
sequence is occupied by the same amino' acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position (as used 
herein amino acid or nucleic acid "identity 11 is equivalent to amino acid or nucleic acid 
1 0 "homology"). The percent identity between the two/sequences is a function of the number 
• of identical positions shared by the sequences, taking into account the number of gaps, and 
the length of each gap, which need to be introduced for optimal alignment of the two 
sequences. 

The comparison of sequences and determination of percent identity between two 

15 sequences can be accomplished using a mathematical algorithm. In a preferred 

embodiment, the percent identity between two amino acid sequences is determined using the 
Needleman and Wunsch (J. Mol Biol (48):444-453 (1970)) algorithm which has been 
incorporated into the GAP program in the GCG software package (available at the Accelrys 
website), using either a Blosum 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 

20 12, 1 0, 8, 6, or 4 and a length weight of 1 , 2, 3, 4, 5, or 6. In yet another preferred 

embodiment, the percent identity between two nucleotide sequences is determined using the 
GAP program in the GCG software package, using a NWSgapdna.CMP matrix and a gap 
weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A preferred, non- 
limiting example of parameters to be used in conjunction with the GAP program include a 

25 Blosum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a 
frameshift gap penalty of 5.- 

In another embodiment,' the percent identity between two amino acid or nucleotide 
sequences is determined using the algorithm of E. Meyers and W. Miller (CompuL Appl 
Biosci., 4:11-17 (1988)) which has been incorporated into the ALIGN prqgram (version 2.0 

30 or version 2.0U), using a PAM120 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4. 

The nucleic acid and polypeptide sequences of the present invention can further be 
used as a "query sequence" to perform a search against public databases to, for example, 
identify other family members or related sequences. Such searches can be performed using 
35 the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (1 990) J. Mol Biol 
215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, 
score = 1 00, wordlength = 12 to obtain nucleotide sequences homologous to 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid molecules of the 
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invention. a BLAST protein searches can be performed with the XBLAST program, score = 
100, wordlength = 3, and a Blosum62 matrix to obtain amino acid sequences homologous to 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt r polypeptide 
molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped 
5 BLAST can be utilized as described in Altschul et at., (1997) Nucleic Acids Res. 

25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default 
parameters of the respective programs (e.g. , XBLAST and NBLAST) can be used. See the 
National Center for Biotechnology website. ', 

The invention also provides 8099, 46455, 54414, 53763, 67076, 67102, 44181, ' 

10 67084FL, or 67084alt chimeric or fusion proteins. As used herein, a 8099, 46455, 54414, 
53763, 67076, 671Q2,'44181, 67084FL, or 67084alt "chimeric protein" or "fusion protein" 
comprises a 8099, 46455, 54414, 53763, 67076, 67102,' 441 81, 67084FL, or 67084alt 
polypeptide operatively linked to a non-8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide. A "8099 f 46455, 54414, 53763, 67076, 67102, 44181, 

15 67084FL, or 67084alt polypeptide" refers to a polypeptide having an amino acid sequence 
corresponding to a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide whereas a "non-8099, 46455, 54414, 53763, 67076„ 67102, 44181, 67084FL, or 
67084alt polypeptide" refers to a polypeptide having an amino acid sequence corresponding 
to a polypeptide which is not substantially homologous to the 8099, 46455, 54414, 53763, 

20 67076, 67102, 44181, 67084FL, or 67084alt polypeptides, respectively, e.g., a polypeptide 
which is different from the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide and which is derived from the same or a different organism. Within a 
.8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt fusion protein the 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide can 

25 correspond to all or a portion of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide, in a preferred embodiment, a 8099, 46455, 54414, 
53763, 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt fusion protein comprises at least one 
biologically active portion of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide. In another preferred embodiment, a 8099, 46455, 54414, 53763, 

30 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt fusion protein comprises at least two 
biologically active portions of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide. Within the fusion protein, the term "operatively linked" 
is intended to indicate that the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide and the non-8099, 46455, 54414, 53763, 67076, 67102, 44181, 

35 67084FL, or 67084alt polypeptide are fused in-frame to each other. The non-8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide can be fused to the 
N-terminus or C-terminus of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide. 
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For example, in one embodiment, the fusion protein is a GST-8099, -46455, -54414, 
-53763, -67076, -67 1 02, -441 8 V, -67084FL; or -67084alt fusion protein iri which the 8099, 
46455, 54414, 53763, 67076„67102, 44181, 67084FL, or 67084alt sequences are fused to 
the C-termuius of the GST sequences. Such fusion proteins can facilitate the purification of 
5 recombinant 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . 

In another embodiment, the fusion protein is a 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide containing a heterologous signal sequence 
at its N-terminus. In certain host cells (fe.g., mammalian host cells), expression and/br . 
. secretion of 8099. 46455, 544i4, 53763, 67076, 67102, 44181, 67084FL, or 67084alt can 
10 be increased through the use of a heterologous signal sequence. 

The 8099, 46455, 54414,' 53763, 67076, 67102, 44181, 67084FL, or 67084alt fusion 
proteins of me invention can be incorporated into pharmaceutical compositions and 
administered to a subject in vivo. The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt fusion proteins can be used to affect the bioavailability of a 8099, 
15 46455, 544 14, 53763, 67076, 671 02, 44 181, 67084FL, or 67084alt substrate. Use of 8099, 
46455, 54414, 53763, 67076^ 67102, 44181, 67084FL, or 67084alt fusion proteins may be 
useful therapeutically for the treatment of disorders caused by, for example, (i) aberrant 
modification or mutation of a gene encoding a 8099, 46455, 54414, 53763, 67076, 67102, ' 
44181, 67084FL, or 67084alt polypeptide; (ii) mis-regulation of the 8099, 46455, 54414, 
20 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene; and (hi) aberrant post- 

translational modification of a 8099, 46455, 54414, 53763, 67076, 67102,' 44181, 67084FL, 
or67084alt polypeptide. 

Moreover, the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt -fusion proteins of the invention can be used as immunogens to produce anti-8099, 
25 anti-46455, anti-54414, anti-53763, anti-67076, anti-67102; anti-44181, anti-67084FL, 
and/or anti-67084alt antibodies in a subject, to purify 8.099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt ligands and in screening assays to identify molecules 
which inhibit the interaction with or transport of 8099, 46455, 54414, 53763, 67076, 67102, • 
■i. 44181, 67084FL, or 67084alt with a 8099, 46455, 54414, 53763^ 67076, 67102, 44181, 
30 67084FL, or 67084alt substrate. 

Preferably, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt chimeric or fusion protein of the invention is produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide 
sequences are ligated together in-frame in accordance with conventional techniques, for 
35 example by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme 
digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, 
alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In 
another embodiment, the fusion gene can be synthesized by conventional techniques 
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including automated DNA synthesizers. Alternatively, PCR amplification of gene 

fragments can be carried out using anchor primers whicji give rise to complementary 

overhangs between two consecutive gene fragments which qan subsequently be annealed 

and reamplified to generate a chimeric gene sequence (see, for example, Current Protocols 

5 in Molecular Biology, eds. Ausubel et ah John Wiley & Sons: 1992). Moreover, many 

expression vectors are commercially available that already encode a fusion moiety (e.g., a 

GST polypeptide). A 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

.? , .. . 

67084alt -encoding nucleic acid can be cloned into such an expression vector such that the 

fusion moiety is linked in-frame to the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

10 67084FL, or 67084alt polypeptide. 

i The present invention also pertains to variants of the 8099, 46455, 54414, 53763, 1 

67076, 67102, 44181, 67084FL, or 67084alt polypeptides which function as either 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt agonists (mimetics) or 

. ' as 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt antagonists. 

15 Variants of the 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 67084FL, or 67084alt. 
polypeptides can be generated by mutagenesis, e.g., discrete point mutation or truncation of 
a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide. An 
agonist of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptides can retain substantially the same, or a subset, of the biological activities of the 

20 naturally occurring form of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide. An antagonist of a 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt polypeptide can inhibit one or more of the activities of the 
naturally occurring form of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide by, for example, competitively modulating a 8099, 46455, 54414, 

25 53763, 67076, 67102, 441 8 1, 67084FL, or 67084alt -mediated activity of a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide. Thus, specific 
. biological effects can be elicited by treatment with a variant of limited function. In one 
embodiment, treatment of a subject with a variant having a subset of the biological activities 
of the naturally occurring form of the polypeptide has fewer side effects in a subject relative 

30 to treatment with the naturally occurring form of the 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide. 

in one embodiment, variants of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide which function as either 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt agonists (mimetics) or as 8099, 46455, 54414, 

35 53763, 67076, 67102, 44181, 67084FL, or 67084alt antagonists can be identified by 
screening combinatorial libraries of mutants, e.g., truncation mutants, of a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide for 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide agonist or 
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antagonist activity. In one embodiment, a variegated library of 8099, 46455, 54414, 53763, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt variants is generated by combinatorial 
mutagenesis at me nucleic acid level and is encoded by a variegated gene library. A 
variegated library of 8099, 46455, 544 1 4, 53763, 67076, 67 1 02, 441 8 1 , 67084FL, or 
5 67084alt variants can be produced by, for example, enzymatically ligating a mixture of 
synthetic oligonucleotides into gene sequences such that a degenerate set of potential 8099, 
46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt sequences is expressible 
as individual polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage 
display) containing the set of 8099, 46455, 54414, 53763, 67076-67102, 44181, 67084FL, 

10 or 67084alt sequences therein. There are a variety of methods which can be used to 
. produce libraries of potential 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FH 
or 67084alt variants from a degenerate oligonucleotide sequence. Chemical synthesis of a 
degenerate gene sequence can be performed in an automatic DNA synthesizer, and the 
synthetic gene then ligated into an appropriate expression vector. Use of a degenerate set of 

15 genes allows for the provision, in one mixture, of all of the sequences encoding the desired 
set of potential 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
sequences. Methods for synthesizing degenerate oligonucleotides are known in the art (see, 
e.g., Narang, S.A. (1983) Tetrahedron 39:3; Itakura et al. (1 984) Anriu. Rev. Biochem. 
53 :323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1:477 

20 In addition, libraries of fragments of a 8099, 46455, 54414,53763, 67076, 67102, 

44181, 67084FL, or 67084alt polypeptide coding sequence can be used to generate a 
variegated population of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt fragments for screening and subsequent selection of variants of a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide. In one 

25 embodiment, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
'67084alt coding sequence with a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded DNA, renaturing the DNA to form 
double stranded DNA which can include sense/antisense pairs from different nicked 

30 products, removing single stranded portions from reformed duplexes by treatment with SI 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, an expression library can be derived which encodes N-terminal, C-terminal and 
internal fragments of various sizes of the 8099, 46455, 54414, 53763, 67076, 671 02, 44181, 
67084FL, or 67084alt polypeptide. 

35 Several techniques are known in the art for screening gene products of combinatorial 

libraries made by point mutations or truncation, and for screening cDNA libraries for gene 
products having a selected property. Such techniques are adaptable for rapid screening of 
the gene libraries generated by the combinatorial mutagenesis of 8099, 46455, 54414, 
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53763, 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptides. The most widely .used 
techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include clonjng the gene library into replicable expressipn vectors, 
transforming appropriate cells with the resulting library of Vectors, and expressing the 

5 combinatorial genes under conditions in which detection of a desired activity facilitates 
isolation of the vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a new technique which enhances the! frequency of functional mutants in 
the libraries, can be used in combination 1 with the screening assays to- identify 8099, 46455, 
54414, 53763, 67Q76, 67102, 44181, 67084FL, or 67084alt variants (Arkin and Yourvan 

10 (1992) Procy Natl Acad. ScL USA 89:781 1-7815; Delgrave et al (1993) Protein 

Engineering 6(3):327 -*331). 

In» one embodiment, cell based assays can be exploited to analyze a variegated 8099, 

46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt library. For example, a 

' i 
library of expression vectors can be transfected into a cell line, which ordinarily responds to 

15 8099, 46455, 54414, 53763, 67Q76, 67102, 44181, 67084FL, or 67084alt in a particular . 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, ot 67084alt substrate- 
dependent manner. The transfected cells are then contacted with 8099, 46455, 544 1 4, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt and the effect of the expression of the 
mutant on signaling by the 8099, 46455, 54414, 53763,67076, 67102, 44181, 67084FL, or 

20 67084alt substrate can be detected, e.g. , phospholipid transport {e.g. , by measuring 
phospholipid levels inside the cell or its varidus cellular compartments, within various 
cellular membranes, or in the extra-cellular medium), hydrolysis of ATP, phosphorylation or 
dephosphorylation of the HEAT protein, and/or gene transcription. Plasmid DNA can then 
be recovered from the cells which score for inhibition, or alternatively, potentiation of 

25 signaling by the HEAT substrate, or which score for increased or decreased levels of 

phospholipid transport or ATP hydrolysis, and the individual clones further characterized. 

An isolated 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide, or a portion or fragment thereof, can be. used as an immunogen to generate 
antibodies that bind 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

30 67084alt using standard techniques for polyclonal and monoclonal antibody preparation. A 
full-length 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide can be used or, alternatively, the invention provides antigenic peptide fragments 
of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt for use as 
immunogens. The antigenic peptide of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

35 67084FL, or 67084alt comprises at least 8 amino acid residues of the amino acid sequence 
shown in SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO:14, 
SEQ ID NO: 17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26 and encompasses an 
epitope of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt such 
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that an antibody raised against the peptide forms a specific immune complex with 8099 
46455, 54414, 53763, 67076, 67102, 44181; 67084FL, or 67084alt . Preferably the ' 
antigemc peptide comprises a t least 10 amino acid residues, more preferably at least 15 
amino acid .residues, even more preferably at least 20 amino acid residues, and most 
5 preferably at least 30 amino acid residues. 

Preferred epitopes encompassed by the antigenic peptide are regions of 8099 46455 
54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt that are located on the surface ' 
of the polypeptide, e.g., hydrophilic regions, as well 'as regions with high antigenicity fsee 
for example, Figures 2, 9, 13, 17,21, 25, 29, 33, and 37). ' 
10 A8099, 46455,54414,53763,67076, 67102, 44181, 67084FL, or 67084alt 

immunogen typically 'is used to prepare antibodies by immunizing a suitable subject (e g. 
rabbrt, goat, mouse or other mammal) with the immunogen. An appropriate immunogenic 
preparation can contain, for example, recombinantly expressed 8099, 46455 54414 53763 

or 67084alt polypeptide. The 
preparation can further include an adjuvant, such as Freund's cbmplete or incomplete • 
adjuvant, or similar immunostimulatory agent. Immunization of a suitable subject with an 
immunogenic 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

preparation induces a polyclonal anti-8099, 46455, 54414, 53763, 67076 67102 44181 
20 67084FL, or 67084alt antibody response. ' ' 

Accordingly, another aspect of the invention pertains to polyclonal anti-8099 46455 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt antibodies. The term ' ' 

"antibody" as used herein refers to immunoglobulin molecules and immunologically active 

portions of immunoglobulin molecules, molecules that contain an antigen binding site 

25 which spe C1 ficaIly binds (immunoreacts with) an antigen; such as 8099 46455 54414 

53763, 67076, 67102, 44181, 67084FL, or 67084alt . Examples of immunologically active 
portions of immunoglobulin molecules include F(ab) and F(ab0 2 fragments which c^ be 
generated by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies that bind 8099, 46455, 54414, 53763 67076 67102 

>0 44181, 67084FL, or 67084alt . The term "monoclonal antibody" or "monoclonal antibody ' 
composmon", as used herein, refers to a population of antibody molecules that contain only 
one species of an antigen binding site capable of immunoreacting with a particular epitope 
of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . A monoclonal 
anybody composition thus typically displays a single binding affinity for a particular 8099 

5 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide with which 
it lmmunoreacts. 

Polyclonal anti-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL or 
67084alt antibodies can be prepared as described above by immunizing a suitable subject 
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with a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
immunogen. The ariti-8099, 46455, 54414, 53763,' 67076, 67102, 44181, 67084FL, or , 
67084alt antibody titer in the immunized subject pan be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 
5 immobilized 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . If 
desired, the antibody molecules directed against 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt can be isolated from the mammal; (e.g., from the blood) and 
further purified by well known techniques, such as protein A chromatography to obtain the 
IgG fraction. At an appropriate time after immunization, e.g., when the anti-8099, 46455, 
10 54414, 53763, 67076, 67102,44181, 67084FL, or 67084alt antibody titers are highest, 
, antibody-producing cells can be obtained from the subject and used to prepare monoclonal 
antibodies by standard techniques, such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497) (see also, Brown et al. (1981) J. Immunol. 
' ' 127:539-46; Brown et al. (1980) J. Biol. Chem .255:4980-83; Yeh et al. (1976) Proc. Natl. 
15 Acad. Sci. USA 76:2927-31; and Yeh et al. (1982) Int. J. Cancer 29:269-75), the more recent 
human B cell hybridoma technique (Kozbor et al. (1983) Immunol Today 4:72), the EBV- 
hybridoma technique (Cole et al. (1 985), Monoclonal Antibodies and Cancer Therapy, Alan 
R. Liss, Inc., pp. 77-96) or trioma techniques. The technology for producing monoclonal 
antibody hybridomas is well known (see generally R. H. Kenneth, in Monoclonal 
20 Antibodies: A New Dimension In Biological Analyses, Plenum Publishing Corp., New York, 
New York (1980); E. A. Lerner (1981) Yale J. Biol. Med., 54:387-402; M. L. Gefter et al. 
(1977) Somatic Cell Genet. 3:231-36). Briefly, an immortal cell line (typically a myeloma) 
is fused to lymphocytes (typically splenocytes) from a mammal immunized with a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt immunogen as 
25 described above, and the culture supernatants of the resulting hybridoma cells are screened 
to identify a hybridoma producing a monoclonal antibody that binds 8099, 46455, 54414, 
53763, 67076, 671 02, 44 1 8 1 , 67084FL, or 67084alt 

Any of the many well known protocols used, for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-8099, 46455, 
30 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt monoclonal antibody (see, e.g., 
G. Galfre et al (1977) Nature 266:55052; Gefter et al. Somatic Cell Genet., cited supra; 
■ Lerner, Yale J. Biol. Med., cited supra; Kenneth, Monoclonal Antibodies, cited supra). 
Moreover, the ordinarily skilled worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the immortal cell line (e.g., a 
35 myeloma cell line) is derived from me same mammalian species as the lymphocytes. For 

example, murine hybridomas can be made by fusing lymphocytes from a mouse immunized 
with an immunogenic preparation of the present invention with an immortalized mouse cell 
line. Preferred immortal cell lines are mouse myeloma cell lines that are sensitive to culture 
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medium containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma ceil lines cdh be used aS a fusion partner according to'stahdard 
techniques, e.g., the P3-NSl/l r Ag4-l, P3-x63-Ag8.653 or Sp2/0-Ag 14 myeloma lines. 
These myeloma lines are available from ATCC. Typically, HAT-sensitive mouse myeloma 
cells are fused to mouse splenocytes using polyethylene glycol ("PEG"). Hybridoma cells 
resulting from the fusion are then selected using HAT medium, which kills unfused and 
unproductively fused myeloma cells (unfused splenocytefc die after several days because 
they are not transformed). Hybridoma cells producirig a monoclonal-antibody of the ' ■ 
invention are detected by screening the hybridoma culture supernatants for antibodies that 
bind 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt , e.g., using a 
standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a monoclonal 
anti-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt antibody can 
be identified and isolated by screening a recombinant combinatorial immunoglobulin library 
(e.g., an antibody phage display library) with 8099, 46455, 54414, 53763, 67076, 67102, . 
44181, 67084FL, or 67084alt to thereby isolate immunoglobulin library members that bind 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . Kits for 
generating and screening phage display libraries are commercially available (e.g., the 
Pharmacia Recombinant Phage Antibody System, Catalog No. 27-9400-0 1 ; and the 
Stratagene SurJZAP™ Phage Display Kit, Catalog No. 240612). Additionally, examples of 
methods and reagents particularly amenable for use in generating and screening antibody 
display library can be found in, for example, Ladner et ah U.S. Patent No. 5,223,409; Kang 
et al, PCT International Publication No. WO 92/18619; Dower etal. PCT International 
Publication No. WO 91/17271; Winter etal. PCT International Publication WO 92/20791; 
Markland et al. PCT International Publication No. WO 92/15679; Breitling et al. PCT 
International Publication WO 93/01288; McCafferty et al. PCT International Publication 
No. WO 92/01047; Garrard etal: PCT International Publication No. WO 92/09690; Ladner 
et al. PCT International Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 
9:1370-1372; Hay et al. (1992) Hum. Aniibod. Hybridomas 3:81-85; Huse et al. (1989) 
Science 246:1275-1281; Griffiths etal. (1993) EMBOJ 12:725-734; Hawkins et al. (1992) 
J. Mol. Biol. 226:889-896; Clarkson et al. (1991) Nature 352:624-628; Gram et al. (1992) 
Proc. Natl. Acad. Sci. USA 89:3576-3580; Garrad et al. (1991) Bio/Technology 9:1373- 
1377; Hoogenboom etal. (1991) Nuc. Acid Res. 19:4133-4137; Barbas e/ al. (1991) Proc. 
Natl. Acad. Sci. USA 88:7978-7982; and McCafferty et al. Nature (1990) 348:552-554. 

Additionally, recombinant anti-8099, 46455, 54414, 53763, 67076, 671 02, 441 8 1, 
67084FL, or 67084alt antibodies, such as chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, which can be made using standard 
recombinant DNA techniques, are within the scope of the invention. Such chimeric and 
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humanized monoclonal antibodies can be produced by recombinant DNA techniques known 
in the art; for example using methods described in Robinson et al. International Application 
No. PCT/US86/02269; Akira, et al European Patent Application 1 84, 1 87; Taniguchi, M., 
European Patent Application 171,496; Morrison et al. European Patent Application 173,494; 

5 Neuberger et al. PCT International Publication No. W,6 86/01533; Cabilly et al U.S. Patent 
No. 4,816,567; Cabilly et al. European Patent Application 125,023; Better etal. (1988) 
Science 240:1041-1043; Liu etal. (l9S7)Proc Natl. Acad Sci. USA 84:3439-3443; Liu et 
al. (1987) J. Immunol. 139 V :3521-3526; Sun et al., (1987) Proc. Natl. Acad. Sci. USA 84:214- 
218;Nishimurae/o/. (1987) Cane, Res. 47:999-1005; Wood et al. (1985) Nature 3 14:446- 

10 449; and Shaw et al. (1988) J. t Natl. Cancer Inst. 80:1553-1559); Morrison, S. L. (1985) 
, Science 229:1202-1207; Oi et al. (1986) BioTechniques 4:214; Winter U.S. Patent i 
5,225,539; Jones et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) Science 
239:1534; and Beidler et al. (1988) J. Immunol. 141:4053-4060. 
' ' An anti-8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

15 antibody (e.g., monoclonal antibody) can be used to isolate 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt by standard techniques, such as affinity 
chromatography or immunoprecipitation. An anti-8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt antibody can facilitate the purification of natural 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt from cells and of 

20 recombinantly produced 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 67084FL, or 
67084alt expressed in host cells. Moreover, an anti-8099, 46455, 54414, 53763, 67076, 
67102, 441 81, 67084FL, or 67084alt antibody can be used to detect 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides (e.g., in a cellular lysate 
or cell supernatant) in order to evaluate the abundance and pattern of expression of the 8099, 

25 46455,54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide. Anti-8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt antibodies can be used 
diagnostically to monitor polypeptide levels in tissue as part of a clinical testing procedure, 
e.g., to, for example, determine the efficacy of a given treatment regimen. Detection can be 
facilitated by coupling (i.e., physically linking) the antibody to a detectable substance. 

30 Examples of detectable substances include various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials, bioluminescent materials, and radioactive materials. 
' Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, p*- 
galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 

35 include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 

dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
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luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 
••' 125 I, 13, I, 35 Sor 3 H. ' . 

III. Recombinant Expression Vectors and Host Cells , . 
5 Another aspect of the invention pertains to vfectors, for example recombinant 

expression vectors, containing a nucleic acid containing a 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 670$4FL, or 67084alt nucleic acid molecule or vectors containing a 
nucleic acid molecule which encodes a 8099, 46455, 54414,' 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide (or a portion thereof). As used herein, the term "vector" 

10 refers to a nucleic acid molecxile capable of transporting another nucleic acid to which it has 
' £>een linked. One type of vector is a "plasmid", which refers to a circular double stranded 1 
DNA loop into which additional DNA segments can be ligated. Another type of vector is a 
viral vector, wherein additional DNA segments can be ligated into the viral genome 
Certain vectors are capable of autonomous replication in a host cell into which they are 

1 5 introduced (e.g. , bacterial vectors having a bacterial origin of replication and episomal 

mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated 
into the genome of a host cell upon introduction into the hpst cell, and thereby are replicated 
along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to herein 

20 as "expression vectors". In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids. In the present specification, "plasmid" and 
"vector" can be used interchangeably as the plasmid is the most commonly used form of 
vector. However, the invention is intended to include such other forms of expression 
vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and 

25 adeno-associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
• invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, selected 
on the basis of the host cells to be used for expression, which is operatively linked to the 

30 nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably 
linked" is intended to mean that the nucleotide sequence of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression of the nucleotide sequence 
(e.g:, in an in vitro transcription/translation system or in a host cell when the vector is 
introduced into the host cell). The term "regulatory sequence" is intended to include 

35 promoters, enhancers and other expression control elements (e.g., polyadenylation signals). 
Such regulatory sequences are described, for example, in Goeddel; Gene Expression 
Technology: Methods in Enzymqlogy 1 85, Academic Press, San Diego, CA (1990). 
Regulatory sequences include those which direct constitutive expression of a nucleotide 
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sequence in many types of host cells and those which direct expression of the nucleotide 
sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will be . 
appreciated by those skilled iq the art that the design pf the expression vecptor can depend on 
such factors as the choice of the host cell to be transformed, the level of expression of 
5 polypeptide desired, and the like. The expression vectors of the invention, can be introduced 
into host cells to thereby produce proteins or peptides, including fusion proteins or peptides, 
encoded by nucleic acids as described herein (e.g., 8099,'46455, 54414, 53763,, 67076, 
67102, 44181, 67084FL, or 67,084alt polypeptides,' mutant forms of 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides, fusion proteins, and the 
10 like). ., 

Accordingly,' an exemplary embodiment provides a method for producing a 
polypeptide, preferably a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide, by culturing in a suitable medium a host cell of the invention (e.g., a 
mammalian host cell such as a non-human mammalian cell) containing a recombinant 

1 5 expression vector* such that the pplypeptide is produced. 

The recombinant expression vectors of the invention cah be designed for expression 
of 8099, 46455', 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides in 
prokaryotic or eukaryotic cells. For example, 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt polypeptides can be expressed in bacterial cells such as E.' 

20 coli 9 insect cells (using baculovirus expression vectors) yeast cells or mammalian cells. 

Suitable host cells are discussed further in Goeddel, Gene Expression Technology: Methods 
in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the recombinant 
expression vector can be transcribed and translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

25 Expression of proteins in prokaryotes is most often carried out in E. coli with vectors 

containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein. Such fusion vectors 
typically serve three purposes: 1) to increase expression of recombinant protein; 2) to 

30 increase the solubility of the recombinant protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion protein. Such enzymes, and their 

35 cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical 

fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, D.B. and Johnson, 
K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
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(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (CrST), maltose E binding 
protein, or protein A, respectively, to the target recombinant protein. ' 

Purified fusion, proteims can be utilized in 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt activity assays, (e.g., direct assays or competitive 
5 assays described in detail below), or to generate antibodies specific for 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptides, for example. Ina 
preferred embodiment, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt fusion protein expressed in a retroviral expression vector of the present invention 
can be utilized to infect bone marrow cells which are subsequently transplanted into 

10 irradiated recipients. The pathology of the subject recipient is then examined after sufficient 
time has passed (e.g., six (6) weeks). 

Examples of suitable inducible non-fusion E. coli expression vectors include pjrc 
(Amann et aL, (1988) Gene 69:301-315) and pET 1 Id (Studier et al. y Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 

15 60-89). Target gene expression, from the pTrc vector relies on host RNA polymerase 

transcription from a hybrid trp-lac fusion promoter. Target gene expression'from the pET 
lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a 
coexpressed viral RNA polymerase (T7 gnl) ? This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident prophage harboring a T7 gnl gene 

20 under the transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express the 
protein in a host bacteria with an impaired capacity to proteolytioally cleave the recombinant 
protein (Gottesman, S., Gene Expression Technology: Methods in Enzymology 1 85, 
Academic Press, San Diego, California (1990) 119-128). Another strategy is to alter the 

25 nucleic acid sequence of the nucleic acid to be inserted into an expression vector so that the 
individual codons for each amino acid are those preferentially utilized in E. coli (Wada et 
aL 9 (1992) Nucleic Acids Res. 20:2111^2118). Such alteration of nucleic acid sequences of 
the invention can be carried out by standard DNA synthesis techniques. 

- In another embodiment, the 8099', 46455, 54414, 53763, 67076, 67102, 44181, 

30 67084FL, or 67084alt expression vector is a yeast expression vector. Examples of vectors 
for expression in yeast S. cerevisiae include pYepSecl (Baldari, et a/., (1987) Embo J. 
6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et 
al 9 (1987) Gene 54:113-123), pYES2 (Invitrogen Corporation, San Diego, CA), and picZ 
(InVitrogen Corp, San Diego, C A). 

35 Alternatively, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

67084alt polypeptides can be expressed in insect cells using baculovirus expression vectors. 
Baculo virus vectors available for expression of proteins in cultured insect cells (e.g. , Sf 9 
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cells) include the pAc series (Smiihetal (1983) Mol Cell Biol. 3 :2 156-2 165) and the pVL 
series (Lucklow and Summers (1 989) Virology 1 70:3 1-39). 

In yet another embodiment, a .nucleic acid qf the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples; of mammalian 

5 expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 

(Kaufman et al (1987) EMBO J. 6:187-195). When used in mammalian cells, the expression 
vector's control functions are often provided by viral regulatory elements. For example, 
commonly used promoters are derived from polyoma, Adenovirus 2, cytomegalovirus and 
Simian Virus 40, For other suitable expression systems for both prokaryotic arid eukaryotic 

10 cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. p., and Maniatis, T. Molecular 
i Cloning: A Laboratory Manual 2nd, ed t Cold Spring Harbor Laboratory, Cold Spring • 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

In another embodiment, the recombinant mammalian expression, vector is capable of 
' directing expression of the nucleic acid preferentially in a particular cell type (e.g., tissue- . 

15 specific regulatory elements are used to express the nucleic acid). Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of suitable tissue-specific promoters 
include the albumin promoter (liver-specific; Pinkert et al (1987) Genes Dev. 1 :268-277), 
lymphoid-specific promoters (Calame and Eaton (1988) Adv. Immunol. 43:235-275), in 
particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J. 8:729-733) 

20 and immunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33:741-748), neuron-specific promoters (e.g., the neurofilament promoter; Byrne and 
Ruddle (1989) Proc. Natl Acad. Set USA 86:5473-5477), pancreas-specific promoters 
(Edlund et al (1985) Science 230:912-916), and mammary gland-specific promoters (e.g., 
milk whey promoter; U.S. Patent No. 4,873 ,3 16 and European Application Publication No. 

25 264,166). Developmentally-regulated promoters are also encompassed, for example the 

murine hox promoters (Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein 
. promoter (Campes and Tilghman ( 1989) Genes Dev. 3 :537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of the invention cloned into the expression vector in an antisense orientation. That 

30 is, the DNA molecule is operatively linked to a regulatory sequence in a manner which 

allows for expression (by transcription of the DNA molecule) of an RNA molecule which is 
' antisense to 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense 
orientation can be chosen which direct the continuous expression of the antisense RNA 

35 molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 

regulatory sequences can be chosen which direct constitutive, tissue specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the form of 
a recombinant plasmid, phagemid or attenuated virus in which antisense nucleic acids are 
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produced under the control of a high efficiency regulatory region, the activity of which can 
be determined by the cell type into which the vector is introduced. For a discussion of the 
regulation of gene expression using antrsense genes see Weintraub, H. et al,, Antisense RNA 
as a molecular tool for genetic analysis, Reviews -Tren^inGenetics,Vo\.\{\)m6: 
Another aspect of the invention pertains to host cells into which a 8099 46455 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid molecule of die 
invention is introduced, e.g., a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL 
or 67084alt nucleic acid molecule within a vector (e.g., a recombinant expression vector) or 
a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleic acid 
molecule containing sequences which allow it to homologously recombine into a specific 
sue of the host cell's genome. The terms "host cell" and "recombinant host cell" are used ' 
interchangeably-herein. It is understood that such terms refer not only to the particular 
subject cell but to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still 
included within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, a 8099 46455 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084ak polypeptide can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese 
hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 
transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g. , DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or 
electroporation. Suitable methods for transforming or transacting host cell's can be found in 
Sambrook, et al. (Molecular Cloning: A La boratory Manual. 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989), and 
30 other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these integrants 
a gene that encodes a selectable marker (e.g., resistance to antibiotics) is generally 
35 introduced into the host cells along with the gene of interest. Preferred selectable markers 
include those which confer resistance to drugs, such as G41 8, hygromycin and methotrexate 
Nucleic acid encoding a selectable marker can be introduced into a host cell on the same 
vector as that encoding a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL or 
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67084alt polypeptide or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic 'acid can be identified.by drug selectidn (e.g., cells that have 
incorporated the selectable marker gene will survive, while the other cells,die). . 

A host cell of the invention, such as-a prokaryotic or eukaryotic host cell in culture, 
5 can be used to produce (Le., express) a 8099, 4645.5, 54414, 53763, 67076,67102, 44181, 
67084FL, or 67084alt polypeptide. Accordingly, the invention further provides methods 
for producing a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide using the host cells of the invention. In one embodiment, the method comprises 
culturing the host cell of the invention (into which a recombinant expression vector 
10 encoding a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide has been introduced) in a suitable medium such that a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide is produced. In another 
embodiment, the method further comprises isolating a 8099, 46455, 544 14, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide from the medium or the host cell. 
15 The host cells of the invention can also be used to produce non-human transgenic . 

animals. For example, in one embodiment, a host cell of the invention is a fertilized oocyte 
or an embryonic stem tell into which 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt -coding sequences have been introduced. Such host cells can then be 
used to create non-human transgenic animals in which exogenous 8099, 46455, 54414, 
20 53763, 67076, 67102, 44181, 67084FL, or 67084alt sequences have been introduced into 
their genome or homologous recombinant animals in which endogenous 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt sequences have been altered. 
Such animals are useful for studying the function and/or activity of a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt and for identifying and/or evaluating 
25 modulators of 8099, 46455, 54414, 53763, 67076, 67102', 44181, 67084FL, or 67084alt 
activity. As used herein, a "transgenic animal" is a non-human animal, preferably a 
mammal, more preferably a rodent such as a rat or mouse, in which one or more of the cells 
of the animal includes a transgene. Other examples of transgenic animals include non- 
human primates, sheep, dogs, cows, goats, chickens, amphibians, and the like. A transgene 
30 is exogenous DNA which is integrated into the genome of a cell from which a transgenic 
animal develops and which remains in the genome of the mature animal, thereby directing 
the expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, a "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in which an endogenous 8099, 
35 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene has been altered by 
homologous recombination between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g. , an embryonic cell of the animal, prior to 
development of the animal. 
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A transgenic animal of the invention can be created by introducing a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -encoding nucleic acid into the . 
male pronuclei of a fertilized oocyte, e.g. , by microinjection, retroviral infection, and 
allowing me oocyte to develop in a pseudopregnant female foster animal. The 8099, 46455, 
5 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt cDNA sequence of SEQ ID 
NO.l, SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO:10, or SEQ ID NO:13 can be introduced 
as a transgene into the genome of a non-human animal; Alternatively, a nonhuman 
homolpgue of a human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt gene, such as a mouse or rat 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

10 67084FL, or 67084alt gene, can be used as a transgene. Alternatively, a 8099, 46455, 
. 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene homologue, such as' • 
another 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt family 
member, can be isolated based on hybridization to the 8099, 46455, 54414, 53763, 67076, 
'67102, 44181, 67084FL, or 67084alt cDNA sequences of SEQ ID NO: 1, SEQ ID NO:3, 

15 SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO.10, SEQ ID '. 
N0.12, SEQ ID NO: 13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:19, 
SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or the ' 

DNA insert of the plasmid deposited with ATCC as Accession Number 

— — _» » or (described further in subsection I above) and used as a 

20 transgene. Intronic sequences and pblyadenylation signals can also be included in the 
transgene to increase the efficiency of expression of the transgene. A tissue-specific 
regulatory sequence(s) can be operably linked to a 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt transgene to direct expression of a 8099, 4645 5, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide to particular cells. 

25 Methods for generating transgenic animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become conventional in the art and are described, for 
. example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et aL, U.S. Patent No. 
4,873,191 by Wagner et al. and in Hogan, B., Manipulating the Mouse Embryo, (Cold 

L- Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1 986). Similar methods are 

30 used for production of other transgenic animals. A transgenic founder animal can be 

identified based upon the presence of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt transgene in its genome and/or expression of 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals carrying 

35 the transgene. Moreover, transgenic animals carrying a transgene encoding a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide can further be bred 
to other transgenic animals carrying other transgenes. 
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To create a homologous recombinant animal, a vector is prepared which contains at 
least a portion of a 8099, 4a55;'54414, 53763, 67076, 67102, 44l81, 67084FL, or 67084alt 
gene into which a deletion, addition or substitution has been introduced ,to thereby alter, e.g., 
functionally disrupt, the 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 67084FL, or 

5 67084alt gene. The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

67084alt gene can be a human gene (e.g., the cDNA of SEQ ID NO:3, SEQ ID NO:6, SEQ 
ID NO:9, SEQ ID NO: 1 2, or SEQ ID NO: 1 5), but more preferably, is a non-human 
homologue of a human 8099, 46455, 54414, 53763^ 67076, 67102, 44181, 67084FL; or; 
67084alt gene (e.g. , a cDNA isolated by stringent hybridization with the nucleotide 

10 sequence of SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO: 10, or SEQ ID 
NO:13). For example^ mouse 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL; or 67084alt gene can be used to. construct a homologous recombination nucleic 
acid molecule, e.g., a vector, suitable for altering, an endogenous 8099* 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt gene in the mouse genome. In a 

15 preferred embodiment, the homologous recombination nucleic acid molecule is designed 
such that, upon homologous recombination, the endogenous 8()99, 46455, 54414, 53763, 
67076, 67102, 441 81, 67084FL, or 67084alt gene is functionally disrupted (i.e. , no longer 
encodes a functional protein; also referred to as a "knock out" vector). Alternatively, the 
homologous recombination nucleic acid molecule can be designed such that, upon 

20 homologous recombination, the endogenous 8099, 46455, 54414, 53763, 67076, 67102, 
441 8 1 , 67084FL, or 67084alt gene is mutated or otherwise altered but still encodes 
functional polypeptide (e.g., the upstream regulatory region can be altered to thereby alter 
the expression of the endogenous 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084ait polypeptide). In the homologous recombination nucleic acid 

25 molecule, the altered portion of the 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 
67084FL, or 67084alt gene is flanked at its 5' and 3' ends by additional nucleic acid 
sequence of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
gene to allow for homologous recombination to occur between the exogenous 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene carried by the homologous 

30 recombination nucleic acid molecule and an endogenous 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt gene in a cell, e.g., an embryonic stem cell. The 
additional flanking 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
nucleic acid sequence is of sufficient length for successful homologous recombination with 
the endogenous gene. Typically, several kilobases of flanking DNA (both at the 5 1 and 3* 

35 ends) are included in the homologous recombination nucleic acid molecule (see, e.g. , 
Thomas, K.R. and Capecchi, M. R. (1 987) Cell 51:503 for a description of homologous 
recombination vectors). The homologous recombination nucleic acid molecule is 
introduced into a cell, e.g. , an embryonic stem cell line (e.g. , by electroporation) and cells in 
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which the introduced 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt gene has homologous/ly recombiried with the endogenous 8099,' 46455, 54414, 
53763, 67076, 67102, 441 81, .67084FL, or 67084alt gene are selected (see e.g., Li, E. etal. 
(1992) Cell 69:915). The selected cells caii then injected into a blastocyst of an animal (e.g. 
5 a mouse) to form aggregation chimeras (see e.g., Bradley, A. in Teratocarcinomds and 
Embryonic Stem Cells: A Practical Approach, E.J. Robertson, ed. (IRL, Oxford, 1987) pp. 
113-152). A chimeric embryo can then be implanted intb a suitable pseudopregnant female 
foster animal and the embryo brought td term. Progeny harboring the homologousl'y > 
recombined DNA in their germ cells can be used to breed animals in which all cells of the 

10 animal contain the homologously recombined DNA by germline transmission of the 

transgene. Methods for constructing homologous recombination nucleic acid molecules, 
e.g. , vectors, or homologous recombinant animals are described further in Bradley, A. 
(1991) Current Opinion in Biotechnology 2:823t829 and in PCT International Publication 
Nos.: WO 90/1 1 354 by Le Mouellec et al. ; WO 91/01 140 by Smithies et al. ; WO 92/0968 

15 by Zijlstra et al. ; and WO 93/04169 by Bems et al. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI . For a 
description of the cre/loxP recombinase system, see, e.g. , Lakso et al (1 992) Proa Natl. 

20 Acad Set USA 89:6232-6236. Another example of a recombinase system is the FLP 
recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) Science 
251:1351-1355. If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a selected 
protein are required. Such animals can be provided through the construction of "double" 

25 transgenic animals, e.g., by mating two transgenic animals/one containing a transgene 
encoding a selected protein and the other containing a transgene encoding a recombinase. 

* Clones of the non-human transgenic animals described herein can also be produced 

according to the methods described in Wilmut, I. et al (1997) Nature 385:810-813 and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a ceil, e.g., a 

30 somatic cell, from the transgenic animal can be isolated and induced to exit the growth cycle 
and enter G G phase. The quiescent cell can then be fused, e.g., through the use of electrical 

pulses, to an enucleated oocyte from an animal of the same species from which the quiescent 
cell is isolated. The reconstructed oocyte is then cultured such that it develops to morula or 
blastocyte and then transferred to pseudopregnant female foster animal. The offspring borne 
35 of this female foster animal will be a clone of the animal from which the cell, e.g., the 
somatic cell, is isolated. 
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IV. Pharmaceutical Compositions 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
nucleic acid molecules, fragments of 8099, 46455, £4414, 53763, 67076, 67102, 44181, . 
67084FL, or 67084alt polypeptides, anti-8099, 46455, 54414, 53763, 67076, 67102, 44181, 

5 67084FL, or 67084alt antibodies, and or 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt modulators, (also referred to herein as "active compounds' 1 ) of the 
invention can be incorporated into pharmaceutical compositions .suitable for administration. 
Such compositions typically comprise the nuclei*? acid molecule, polypeptide, or antibody 
and a pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 

10 acceptable carrier" is intended, to include any and all solvents, dispersion media, coatings, 
, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, i 
compatible with .pharmaceutical administration. The use of such media and agents for 
pharmaceutically active substances is well known in the art. Except insofar as any 
' conventional media or agent is incompatible with the active compound, use thereof in the 

1 5 compositions is contemplated. Supplementary active compounds can also be incorporated 
into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible with 
its intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), transdermal 

20 (topical), transmucosal, and rectal administration. Solutions or suspensions used for 

parenteral, intradermal, or subcutaneous application can include the following components: 
a sterile diluent such as water for injection, saline solution, fixed oils, polyethylene glycols, 
glycerine, propylene glycol or other synthetic solvents; antibacterial agents such as benzyl 
alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; chelating 

25 ajgents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or 

phosphates and agents for the adjustment of tonicity such as sodium chloride or dextrose. 
pH can be adjusted with acids or bases, such as hydrochloric acid or sodium hydroxide. The 
parenteral preparation can be enclosed in ampoules, disposable syringes or multiple dose 
vials made of glass or plastic. 

30 Pharmaceutical compositions suitable for injectable use include sterile aqueous 

solutions (where water soluble) or dispersions and sterile powders for the extemporaneous 
' preparation of sterile injectable solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EL™ (BASF, 
Parsippany, NJ) or phosphate buffered saline (PBS). In all cases, the composition must be 

35 sterile and should be fluid to the extent that easy syringability exists. It must be stable under 
the conditions of manufacture and storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The carrier can be a solvent or 
dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
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propylene glycol, and liquid polyetheylene glycol, and the like), and suitable mixtures 
thereof. The proper fluidity can be maintained, for example, by the use of a coating such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and by the 
use of surfactants. Prevention of the acti on of microorganisms can .be achieved by various 
5 antibacterial and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic 
acid, thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about by 
. including in the composition an agent which delays absorption, for example, aluminum 
10 monostearate and gelatin. ' 

' ; Sterile injectable solutions can be prepared by incorporating the active compound ' 
(e.g., a fragment of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide or an anti-8099, 46455, 54414, 53763, 67076, 67102, 441 81, 
67084FL, or 67084alt antibody) in the required amount in an appropriate solvent with one 
15 or a combination of ingredients enumerated above, as required, followed by filtered 

sterilization. Generally, dispersions are prepared by incorporating the active compound into 
a sterile vehicle which contains a basic dispersion mediumand the required other ingredients 
from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and freeze- 
drying which yields a powder of the active ingredient plus any additional desired ingredient 
from a previously sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can 
be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. Pharmaceutical^ compatible 
binding agents, and/or adjuvant materials can be included as part of the composition . The 
tablets, pills, capsules, troches and the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as microcrystalline cellulose, gum tragacanth 
or gelatin; an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
Primogel, or com starch; a lubricant such as magnesium stearate or Sterotes; a glidant such 
as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring 
agent such as peppermint, methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable propellant, 
e.g. , a gas such as carbon dioxide, or a nebulizer. 
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Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration; penetrants appropriate to the barrier to be 
permeated are used in the fonpulation. Such penetrants are. generally k^Qwn in, the art, and 
include, for, example, for transmucosal administration, detergents, bile salts, and fusidic acid 
5 derivatives. Transmucosal administration can be accomplished through the use of nasal 
sprays or suppositories. For transdermal administration, the active compounds are 
formulated into ointments, salves, gels, or creams as generally known in the art. 

The compounds can alpo be prepared in the form of suppositories {e.g. , with ' . 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
10 enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect 
the compound against rapid elimination from the body, such as a controlled release 
formulation, including implants and microencapsulated delivery systems. Biodegradable, 
biocompatible polymers can be used, such as, ethylene vinyl acetate, polyanhydrides, 
15 polyglycolic acid; collagen, polyorthoesters, and polylactic acid. Methods for preparation of 
such formulations will be apparent to those skilled in the art. The materialman also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. Liposomal 
suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be 
20 prepared according to methods known to those skilled in the art, for example, as described in 
U.S. Patent No. 4,522,811. ' ' 

It is especially advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
25 treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the particular 
therapeutic effect to be achieved, and the limitations inherent in the art of compounding such 
30 an active compound for the treatment of individuals. 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 
LD50 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
35 the therapeutic index and it can be expressed as the ratio LD50/ED50. Compounds which 
exhibit large therapeutic indices are preferred. While compounds that exhibit toxic side 
effects may be used, care should be taken to design a delivery system that targets such 
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compounds to the site of affected tissue in order to minimize potential damage to uninfected 
cells and, thereby, reduce side effects. ' ' ' 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED50 with little or 
no toxicity. The dosage may vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any compound used in the method of 
the invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC50 (/.*, the concentration of the test compound 
which achieves a half-maximal inhibition of symptoms) as determined in cell culture Such 
information can be used to more accurately determine useful doses in humans Levels in 
plasma may be measured, for example, by high performance liqujd chromatography. 
As defined herein, a therapeutically effective amount of polypeptide (i e an 
15 effective dosage) ranges from about 0.001 to 30 mg/kg body weight, preferably about 0 01 
to 25 mg/kg body weight, more preferably about 0.1 to 20 mg/kg body weight, and even 
more preferably about 1 to 10 mg/kg, 2 >o 9 mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 
mg/kg body weight. The skilled artisan will appreciate that certain factors may influence the 
dosage required to effectively treat a subject, including but not limited to the severity of the 
disease or disorder, previous treatments, the general health and/or age of the subject and 
other diseases present. Moreover, treatment of a subject with a therapeutically effective 
amount of a polypeptide or antibody can include a single treatment or, preferably, can 
include a series of treatments. 

In a preferred example, a subject is treated with antibody or polypeptide in the range 
25 of between about 0.1 to 20 mg/kg body weight, one time per week for between about 1 to 10 
weeks, preferably between 2 to 8 weeks, more preferably between about 3 to 7 weeks and 
even more preferably for about 4, 5, or 6 weeks. It will also be appreciated that the effective 
dosage of antibody or polypeptide used for treatment may increase or decrease over the 
course of a particular treatment. Changes in dosage may result and become apparent from 
30 the results of diagnostic assays as described herein. 

The present invention encompasses agents Which modulate expression or activity 
An agent may, for example, be a small molecule. For example, such small molecules 
include, but are not limited to, peptides, peptidomimetics, amino acids, amino acid analogs 

polynucleotides, polynucleotide analogs, nucleotides, nucleotide analogs, organic or 
35 inorganic compounds (/.*.,. including heteroorganic and organometallic compounds) having 
a molecular weight less than about 1 0,000 grams per mole, organic or inorganic compounds 
having a molecular weight less than about 5,000 grams per mole, organic or inorganic 
compounds having a molecular weight less than about 1,000 grams per mole, organic or 
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inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutical^ acceptable forms of such compounds. It is 
understood that appropriate doses of small molecule agents depends upon a number of 
factors within the ken of the ordinarily skilled physician, veterinarian, or researcher. The 
5 dose(s) of the small molecule will vary, for example, depending upon the identity, size, and 
condition of the subject or sample being treated, further depending upon the route by which 
the composition is to be administered, if applicable, and the effect which the practitioner 
desires the small molecule to have upon the nucleic acid or polypeptide of the invention. 

Exemplary doses include milligram or microgram amounts of die small molecule per 
l o kilogram of subject or sample weight (e.g. , about 1 microgram per kilogram to about 500 
■ milligrams per kilogram, about 1 00 micrograms per kilogram to about 5 milligrams per ' 
kilogram, or about 1 microgram per kilogram to about 50 micrograms per kilogram. It is 
furthermore understood that appropriate doses of a small molecule depend upon the potency 
' ' of the small molecule with respect to the expression or activity to be modulated. Such 
15 appropriate doses may be determined using the assays described herein. When one or more 
of these small molecules is to be administered to an animal (e.g. , a human) in order to 
modulate expression or activity of a polypeptide or nucleic acid of the invention, a 
physician, veterinarian, or researcher may, for example, prescribe a relatively low dose at 
first, subsequently increasing the dose until an appropriate response is obtained. In addition, 
20 it is understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific compound employed, the age, 
body weight, general health, gender, and diet of the subject, the time of administration, the 
route of administration, the rate of excretion, any drug combination, and the degree of 
expression or activity to be modulated. 
25 ' Further, an antibody (or fragment thereof) may be conjugated to a therapeutic moiety • 
such as a cytotoxin, a therapeutic agent or a radioactive, metal ion. A cytotoxin or cytotoxic 
agent includes any agent that is detrimental to cells. Examples include taxol, cytochalasin 
B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, 
vinblastine, col chicin, doxorubicin, daun'orubicin, dihydroxy anthracin dipne, mitoxantrone, 
30 mithramycin, actinomycin D, 1 -dehydrotestosterone, glucocorticoids, procaine, tetracaine, 
lidocaine, propranolol, and puromycin and analogs or homologues thereof. Therapeutic 
' agents include, but are not limited to, antimetabolites (e.g., methotrexate, 6-mercaptopurine, 
6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents (e.g. , 
mechlorethamine, thioepa chlorambucil, melphalan, carmustine (BSNU) and lomustine 
35 (CCNU), cyclothosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and 
cis-dichlorodiamine platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin 
(formerly daunomycin) and doxorubicin), antibiotics (e.g., dactinomycin (formerly 
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actinomycin), bleomycin, mithxamycin, and anthramycin (AMC)), and anti-mitotic agents 
(e.g., vincristine and Vinblastine). ' ■ • 

The conjugates of the.invention can be used for modifying a gi ven biological 
response, the drug moiety is not to be construed as limited to classical chemical therapeutic 
agents. For example, the drug moiety may be aprotein or polypeptide possessing a desired 
biological activity. Such proteins may include, for example, a toxin such as abrin ricin A 
pseudomonas exotoxin, or diphtheria toxin; a protein sucla as tumor necrosis factor alpha- 
mterferon, beta-interferon, nenve growtH factor, platelet derived growth factor tissue • 
plasminogen activator; or, biological response modifiers such as, for example,' lympholdnes 
interleukhW ("IL-1 "),interleukin-2 ("IL-2"), interleukin-6 ("11,6"), granulocyte 
macrophage colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor 
("G-CSF"), or other growth factors. , 

Techniques for conjugating such therapeutic moiety to antibodies are well known 
see, e.g., Arnon et al, "Monoclonal Antibodies For Immunotargeting Of Drugs In Cancer 
Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfelde, al (eds) p P 243-56 
(Alan R. Liss, Inc. 1985); Hellstrom et al, "Antibodies For Drug Delivery",' in Controlled 
Drug Delivery (2nd Ed.), Robinson et al (eds.), pp. 623-53 (Marcel Dekker, Inc 1987)- 
Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer Therapy: A Review" in 
Monoclonal Antibodies '84: Biological And Clinical Applications, Pinchera et al (eds ) pp 
475-506 (1985); "Analysis, Results, And Future Prospective Of The Therapeutic Use Of 
Radiolabeled Antibody In Cancer Therapy", in Monoclonal Antibodies For Cancer 
Detection And Therapy, Baldwin et al (eds.), pp. 303-16 (Academic Press 1985) and 
Thorpe etal, "The Preparation And Cytotoxic Properties Of Antibody-Toxin Conjugates" 
Immunol. Rev., 62:1 19-58 (1982). Alternatively, an antibody can be conjugated to a second 
antibody to form an antibody heterocpnjugate as described by Segal in U S Patent No 
4,676,980. 

The nucleic acid molecules of the invention can be inserted into vectors and used as 
gene therapy vectors. Gene therapy vectors can be delivered to a subject by, for example 
intravenous injection, local administration (see U.S. Patent 5,328,470) or by stereotactic ' 
injection (see e.g., Chen et al (1994) Proc. Natl Acad. Sci. USA 91 :3054-3057) The 
pharmaceutical preparation of the gene therapy vector can include the gene therapy vector in 
an acceptable diluent, or can comprise a slow release matrix in which the gene delivery 
vehicle is imbedded. Alternatively, where the complete gene delivery vector can be 
produced intact from recombinant cells, e.g., retroviral vectors, the pharmaceutical 
preparation can include one or more cells which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. 
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V. Uses and Methods of the Invention 

~~ The nucleic acid molecules, proteins, protein homologues, antibodies, and 
modulators.described herein can be used in one or more of the following methods: a) 
screening assays; b) predictive medicine (e.g., diagnostic assays, prognostic assays, 
5 monitoring clinical trials, and pharmacogenetics); and c) methods of treatment (e.g. , 
therapeutic and prophylactic). 

As described herein, an 8099 and/or 46455 polypeptide of the invention has one or 
more of the following activities: (1) bind a monosaccharide,' e.g. , D-glucose; D-fructose, D- 
galactose, and/or mannose, (2) transport monosaccharides across a cell membrane, (3) 
10 influence insulin and/or glucagon secretion, (4) maintain sugar homeostasis in a cell, (5) 
, function as a neuronal transporter, and (6) mediate trans-epithelial movement in a cell. • 
As described herein, a 54414 and/or 53763 protein of the invention has one or more 
of the following activities: (i) interaction with a 54414 or 53763 substrate (e.g., a potassium 
' ' ion or a cyclic nucleotide); (ii) conductance or transport of a 54414 or 53763 substrate 
15 across a cellular membrane; (iii) interaction with a second non-54414 or 53763 protein (e.g., 
a 54414 or 53763 polypeptide or a non-54414 or 53763 potassium channel subunit); (iv) 
modulation (e.g., maintenance and/or rectification) of membrane potentials; (v) regulation of 
target molecule availability or activity; (vi) modulation of intra- or intercellular signaling 
and/or gene transcription (e.g., either directly or indirectly); (viii) generation of outwardly 
20 rectifying currents; (viii) modulation of membrane excitability; (ix) modulation of the 
release of neurotransmitters; (x) regulation of contractility (e.g., of smooth muscle cells), 
secretion, and/or synaptic transmission; and/or (xi) modulation of processes which underlie 

learning and memory. 

Preferred activities of 54414 further include at least one of the following activities: 
25 (i) interaction with maxi-K potassium channels (i.e., large conductance channels, in 

particular Slo); (ii) modulation of maxi-K potassium channel activity (e.g., S/o-mediated 
activities); (iii) generation of intermediate conductance channels; and/or (iv) regulation of 
contractility (e.g., of smooth muscle cells), secretion, and/or synaptic transmission, in 
particular, via modulation of Slo. 
30 Preferred activities of 53763 further include at least one of the following activities: 

(i) interaction with Shaker (Sh) potassium channels and/or channel subunits; (ii) modulation 
' of Shaker (Sh) potassium channel activity (e.g., termination of prolonged membrane 
depolarization; (iii) modulation of high voltage activating channel activity and/or 
inactivating channel activity, and the like. 
35 As described herein, a 67076, 67102, 44181, 67084FL, or 67084alt polypeptide of 

the invention has one or more of the following activities: (i) interaction with a 67076, 
67102, 44181, 67084FL, or 67084alt substrate or target molecule (e.g., a phospholipid, 
ATP, or a non-67076, 67102, 44181, 67084FL, or 67084alt protein); (ii) transport of a 



85 



BNSDOCID: <WO_02055701 A2_l_> 



WO 02/055701 PCT/US01/49060 

67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt substrate or target molecule (e.g. , an 
aminophospholipid such as phosphatidylserine or phosphatidylethanolamine) from one side 
of a cellular membrane to the other; (iii) the ability- to be phosphorylated or 
dephdsphorylated; (iv) adoption of an El conformation or an E2 conformation; (v) 
5 conversion ofa 67076, 67102, 44181, 67084FL, or 67084alt substrate or target molecule to 
a product (e.g., hydrolysis of ATP); (vi) interaction with a second non- 67076, 67102, 
44181, 67084FL, or 67084alt protein; (vii) modulation of substrate or target molecule 
location (e.g., modulation of phospholipid location within a bell and/or location with respect 
to a cellular membrane); (viii) maintenance of aminophospholipid gradients; (ix) modulation 
10 of intra- or intercellular signaling and/or gene transcription (e.g. ; either directly or 
1 indirectly); and/or (x) modulation of cellular proliferation, growth, differentiation, * 
apoptosis, absorption, or secretion. 

The isolated nucleic acid molecules of the invention can be usedj for example, to 
_ express 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

1 5 polypeptides (e.g. , via a recombinant expression vector in a host cell in gene therapy 
applications), to detect 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt mRNA (e.g., in a biological sample) or a genetic alteration in a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene, and to modulate 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt activity, as described 
20 further below. The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptides can be used to treat disorders characterized by insufficient or 
excessive production of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt substrate or production or transport of 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt inhibitors, for example, 8099, 46455, 54414, 53763, 67076, 
25 67102, 44181, 67084FL, or 67084alt associated disorders. 

As used herein, a "sugar transporter" includes at protein or polypeptide which is 
• involved in transporting a molecule, e.g., a monosaccharide such as D-glucose, D-fructose, 
■ ■ D-galactose or mannose, across the plasma membrane of a cell, e.g. , a liver cell, fat cell, 
muscle cell, or blood cell, such as an erythrocyte. Sugar transporters regelate sugar 
30 homeostasis in a cell and, typically, have sugar substrate specificity. Examples of sugar 
transporters include glucose transporters, fructose transporters, and galactose transporters. 

As used herein, a "sugar transporter mediated activity " includes an activity which 
involves a sugar transporter, e.g., a sugar transporter in a liver cell, fat cell, muscle cell, or 
blood cell, such as an erythrocyte. Sugar transporter mediated activities include the 
35 transport of sugars, e.g. , D-glucose, D-fructose, D-galactose or mannose, into and out of 
cells; the stimulation of molecules that regulate glucose homeostasis (e.g., insulin and 
glucagon), from cells, e.g., pancreatic cells; and the participation in signal transduction 
pathways associated with sugar metabolism. 
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Asthe 8099 and 46455 molecules of the present invention are sugar transporters, 
they may be useful for developing novel diagnostic and therapeutic agents for sugar . 
transporter associated disorders. As used herein, the terms "sugar transporter associated 
disorder" apd "8099 and 46455 disorder," Used interchangeably herein, includes a disorder, 

5 disease, or condition which is characterized by an aberrant, e.g., upregulated or 

downregulated, sugar transporter mediated activity. Sugar transporter associated disorders 
typically result in, e.g., upregulated or downregulated, sxigar levels in a cell. Examples of 
sugar transporter associated disorders infclude disorders associated with sugar homedstasis, 
such as obesity, anorexia, type-1 diabetes, type-2 diabetes, hypoglycemia, glycogen storage 

10 disease (Von Gierke disease), type I glycogenosis, bipolar disorder, seasonal affective 
disorder, and cluster 6 personality disorders. 

As used interchangeably herein, a "potassium channel associated disorder" or a 
"54414 or 53763 associated disorder" include a disorder, disease or condition which is 
caused or characterized by a misregulation (e.g., downregulation or upregulation) of 54414 

15 or 53763 activity. 54414 or 53763 associated disorders can detrimentally affect cellular 
functions such as cellular proliferation, growth, differentiation,' inter- or intracellular 

communication; tissue function, such as cardiac function or musculoskeletal function; 

■ • ■ 

systemic responses in an organism, such as nervous system responses, hormonal responses 
(e.g., insulin response), or immune responses; and protection of cells from toxic compounds 

20 (e.g., carcinogens, toxins^ or mutagens). 

In a preferred embodiment, 54414 or 53763 associated disorders include CNS 
disorders such as cognitive and neurodegenerative disorders, examples of which include, but 
are not limited to, Alzheimer's disease, dementias related to Alzheimer's disease (such as 
Pick's disease), Parkinson's and other Lewy diffuse body diseases, senile dementia, 

25 Huntington's disease, Gilles de la Tourette's syndrome, multiple sclerosis, amyotrophic 
lateral sclerosis, progressive supranuclear palsy, epilepsy, seizure disorders, and Jakob- 
Creutzfieldt disease; autonomic function disorders such as hypertension and sleep disorders, 
and neuropsychiatric disorders, such as depression, schizophrenia, schizoaffective disorder, 
korsakoffs psychosis, mania, anxiety disorders, or phobic disorders; learning or memory 

30 disorders, e.g., amnesia or age-related memory loss, attention deficit disorder, dysthymic 
disorder, major depressive disorder, mania, obsessive-compulsive disorder, psychoactive 
substance use disorders, anxiety, phobias, panic disorder, as well as bipolar affective 
disorder, e.g., severe bipolar affective (mood) disorder (BP-1), and bipolar affective 
neurological disorders, e.g., migraine and obesity. Further CNS-related disorders include, 

35 for example, those listed in the American Psychiatric Association's Diagnostic and 
Statistical manual of Mental Disorders (DSM), the most current version of which is 
incorporated herein by reference in its entirety. 
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54414 or 53763 associated disorders also include cellular proliferation, growth, 
differentiation, or apoptosis'disorders. Cellularproliferation, growth, differentiation, or 
apoptosis disorders include those disorders that affect cell proliferation,, growth^ 
differentiation, or apoptosis processes. As used herein, a "cellular proliferation,' growth, 
differentiation, or apoptosis process" is a process by which a cell increases in number, size 
or content, by which a cell develops a specialized set of characteristics which differ from 
that of other cells, or by which a cell undergoes programmed cell death. The 54414 or 
53763 molecules of the present invention may modulate cellular growth, proliferation, • 
differentiation, or apoptosis, and may play a role in disorders characterized by aberrantly 
regulated growth, proliferation, differentiation, or apoptosis. Such disorders include cancer, 
e.g., carcinoma, sarcoma, or leukemia; tumor angiogenesis and metastasis; skeletal 
dysplasia; hepatic disorders; and hematopoietic and/or myeloproliferative disorders. 

Further examples of 54414 or 53763 associated disorders include cardiac-related 
disorders. Cardiovascular system disorders in which the 54414 or 53763 molecules of the 
15 invention may be directly or indirectly involved include arteriosclerosis, ischemia 

reperfusion injury, restenosis, arterial inflammation, vascular wall remodeling, ventricular 
remodeling, rapid ventricular pacing, coronary microembolism; tachycardia, bradycardia, 
pressure overload, aortic bending, coronary artery ligation, vascular heart disease, atrial ' 
fibrilation, Jervell syndrome, Lange-Nielsen syndrome, long-QT syndrome, congestive heart 
failure, sinus node dysfunction, angina, heart failure, hypertension, atrial fibrillation, atrial 
flutter, dilated cardiomyopathy, idiopathic cardiomyopathy, myocardial infarction, coronary 
artery disease, coronary artery spasm, and arrhythmia. 54414 or 53763 associated disorders 
also include disorders of the musculoskeletal system such as paralysis and muscle weakness, 
e.g., ataxia, myotonia, and myokymia. 
25 54414 or 53763 associated or related disorders also include hormonal disorders, such 

as conditions or diseases in which the production and/or regulation of hormones in an ' 
organism is aberrant. Examples of such disorders and diseases include type I and type II 
diabetes mellitus, pituitary disorders (e.g. , growth disorders), thyroid disorders (e.g. , 
hypothyroidism or hyperthyroidism), and reproductive or fertility disorders (e.g. , disorders 
30 which affect the organs of the reproductive system, e.g., the prostate gland, the uterus, or the 
vagina; disorders which involve an imbalance in the levels of a reproductive hormone in a 
subject; disorders affecting the ability of a subject to reproduce; and disorders affecting 
secondary sex characteristic development, e.g., adrenal hyperplasia). 

54414 or 53763 associated or related disorders also include immune disorders, such 
as autoimmune disorders or immune deficiency disorders, e.g., congenital X-linked infantile 
hypogammaglobulinemia, transient hypogammaglobulinemia, common variable 
immunodeficiency, selective IgA deficiency, chronic mucocutaneous candidiasis, or severe 
combined immunodeficiency. 
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As used interchangeably herein, a "phospholipid transporter associated disorder" or a 
"67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt associated disorder" includes a disorder, 
disease or condition which is caused or characterized by a misregulation (e.g. , 
downfegulation or upregulation) of 67076, 67102, 44181, 67084FL, or 67084alt activity. 
5 67076, 67102, 44181, 67084FL, or 67084alt associated disorders can detrimentally affect 
cellular functions such as cellular proliferation, growth, differentiation, inter- or intra- 
cellular communication; tissue function; such as cardiac, function or musculoskeletal 
function; systemic responses in an organism, such as nervous system responses, hormonal 
responses {e.g., insulin response), or immune responses; and protection of cells from toxic 
1 0 compounds (e. g. , carcinogens^ toxins, or mutagens)., Examples of 67076, 67 1 02, 44 1 8 1 , 
i 67084FL, or 67084alt associated disorders include CNS disorders such as cognitive and 1 
neurodegenerative disorders, examples of which include, but are not limited to, Alzheimer' s 
disease, dementias related to Alzheimer's disease (such as Pick's disease), Parkinson's and 
' other Lewy diffuse body diseases, senile dementia, Huntington's disease, Gilles de la 
15 Tourette's syndrome, multiple sclerosis, amyotrophic lateral sclerosis, progressive 

supranuclear palsy, epilepsy, seizure disorders, and Jakob-Creutzfieldt disease; autonomic 
function disorders such as hypertension and sleep disorders, and neuropsychiatry disorders, 
such as depression, schizophrenia, schizoaffective disorder* korsakoff s psychosis, mania, 
anxiety disorders, or phobic disorders; learning or memory disorders, e.g., amnesia or age- 
20 related memory loss, attention deficit disorder, dysthymic disorder, major depressive 
disorder, mania, obsessive-compulsive disorder, psychoactive substance use disorders, 
anxiety, phobias, panic disorder, as well as bipolar affective disorder, e.g., severe bipolar 
affective (mood) disorder (BP-1), and bipolar affective neurological disorders, e.g., migraine 
and obesity. Further CNS-related disorders include, for example, those listed in the 
25 American Psychiatric Association's Diagnostic and Statistical manual of Mental Disorders 
(DSM), the most current version of which is incorporated herein by reference in its entirety. 

Further examples of 67076, 67 1 02, 44 18 1 , 67084FL, or 67084alt associated 
disorders include cardiac-related disorders. Cardiovascular system disorders in which the 
67076, 67102, 44181, 67084FL, or 67084alt molecules of the invention may be directly or 
30 indirectly involved include arteriosclerosis, ischemia reperfusion injury, restenosis, arterial 
inflammation, vascular wall remodeling, ventricular remodeling, rapid ventricular pacing, 
' coronary microembolism, tachycardia, bradycardia, pressure overload, aortic bending, 
coronary artery ligation, vascular heart disease, atrial fibrilation, Jervell syndrome, Lange- 
Nielsen syndrome, long-QT syndrome, congestive heart failure, sinus node dysfunction, 
35 angina, heart failure, hypertension, atrial fibrillation, atrial flutter, dilated cardiomyopathy, 
idiopathic cardiomyopathy, myocardial infarction, coronary artery disease, coronary artery 
spasm, and arrhythmia. 67076, 67102, 44181, 67084FL, or 67084alt associated disorders 
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also include disorders of the musculoskeletal system such as paralysis and muscle weakness, 
e.g., ataxia, myotonia, and myokymia. 

67076, 67102, 44181, 67084FL, or 67084alt associated disorders also include 
cellular proliferation, growth, or differentiation disorders. Cellular, proliferation, growth, or 
5 differentiation disorders include those disorders that affect cell proliferation, growth, or 
differentiation processes. As used herein, a "cellular proliferation, growth, or differentiation 
process" is a process by which a cell increases' in number, size or content, or by which a cell 
develops a specialized set of characteristics which differ froth that of other cells. The 
67076, 67102, 44181, 67084FL, or 67084alt molecules of the present invention-are involved 

10 in phospholipid transport mechanisms, which are known to be iiivolved in cellular growth, 
1 proliferation, and differentiation.processes. Thus, the 67076, 67102, 44181, 67084FL, or 1 
67084alt molecules may modulate cellular growth, proliferation, or differentiation, and may 
play a role in disorders characterized by aberrantly regulated growth, proliferation, or 
differentiation. Such disorders include cancer, e.g., carcinoma, sarcoma, or leukemia; tumor 

15 angiogenesis and metastasis; skeletal dysplasia; hepatic disorders; and hematopoietic and/or 
myeloproliferative disorders. 

67076, 67102, 44181, 67084FL, or 67084alt associated or related disorders also 
include hormonal disorders, such as conditions or diseases in which the production and/or 
regulation of hormones in an organism is aberrant. Examples of such disorders and diseases 

20 include type I and type II diabetes mellitus, pituitary disorders (e.g. , growth disorders), 
thyroid disorders (e.g. , hypothyroidism or hyperthyroidism), and reproductive or fertility 
disorders {e.g. , disorders which affect the organs of the reproductive system, e.g. , the 
prostate gland, the uterus, or the vagina; disorders which involve an imbalance in the levels 
of a reproductive hormone in a subject; disorders affecting the ability of a subject to 

25 reproduce; and disorders affecting secondary sex characteristic development, e.g., adrenal 
hyperplasia). 

67076, 67102, 44181, 67084FL, or 67084alt associated or related disorders also 
include immune disorders, such as autoimmune disorders or immune deficiency disorders, 
e.g., congenital X-linked infantile hypogammaglobulinemia, transient 
30 hypogammaglobulinemia, common variable immunodeficiency, selective IgA deficiency, 
chronic mucocutaneous candidiasis, or severe combined immunodeficiency. 

8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt associated 
or related disorders also include disorders affecting tissues in which 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt protein is expressed. 
35 In addition, the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

67084alt polypeptides can be used to screen for naturally occurring 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt substrates, to screen for drugs or 
compounds which modulate 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
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67084alt activity, as well as to treat disorders characterized by insufficient or excessive 
production of 8099, 46455/54414, 53763, 67076; 67102, 44181,67084Fi:, or 67084alt 
polypeptide or production of 8099, 46455, 54414, 53763, 67076, 67102, 44181/67084FL, 
or 67084altpolypeptide forms which have decreased, aberrant or unwanted activity 
5 compared to 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt wild 
type polypeptide (e.g. , sugar transporter associated disorder, potassium channel associated 
diosrders, a phospholipid transporter-associated disorders). Moreover, the antr-8099, 46455, 
54414, 53763, 67076, 67162, 44181, 67084FL, or 67084alt antibodies of the invention can 
be used to detect and isolate 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
10 67084alt polypeptides,' to regulate the bioavailability of 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptides, and modulate 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or. 67084alt activity. . 

A. Screening Assays . 

15 The invention provides a method (also referred to herein as a "screening assay") for 

identifying modulators, i.e., candidate or test compounds or agents (e.g., peptides, 
. peptidomimetics, small molecules or other drugs) which bind to 8099, 46455, 54414, 53763, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptides, have a stimulatory or inhibitory 
effect on, for example, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or ' 

20 67084alt expression or 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 
67084alt activity, or have a stimulatory or inhibitory effect on, for example, the expression 
or activity of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
substrate. 

In one embodiment, the invention provides assays for screening candidate or test 
25 compounds which are substrates of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide or polypeptide or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of a 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide or polypeptide or biologically active 
30 portion thereof. The test compounds of the present invention can be obtained using any of 
the numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel solid phase or solution phase libraries; 
synthetic library methods requiring deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity chromatography selection. The 
35 biological library approach is limited to peptide libraries, while the other four approaches 
are applicable to peptide, non-peptide oligomer or small molecule libraries of compounds 
(Lam, K.S. (1997) Anticancer DrugDes. 12:145). 



91 



BNSDOCID: <WO__02055701A2J_> 



WO 02/055701 PCT/US01/49060 

a I . ■ . 

Examples of methods for the synthesis of molecular libraries can be found in the art, 
• for example in: DeWitt et al. (1993) Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb etal. 
(1994) Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al. (1994). J. Med Chem. 
37:2678; Cho et al. (1993) Science 261:1303; Carrell et al. (1994) Angew. Chem. Int. Ed. 
5 Engl. 33:2059; Carell et al. (1994) Angew. Chem: Int.' Ed. Engl. 33:2061 ; and in Gallop et 
al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g. , Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP '409), 
10 plasmids (Cull et al. (1992) Proc NatlAcadSci USA 89: 1865-1869) or on phage (Scott and 
i Smith (1 990) Science 249:386-390); (Devlin (1 990) Science 249:404-406); (Cwirla et al. • 
(1990) Proc. Natl. Acad. Sci. 87:6378-6382); (Felici (1991) J: Mol Biol. 222:301-310); 
(Ladner supra.). 

In one embodiment, an assay is a cell-based assay in which a cell which expresses a 
15 8099, 46455, 544 1 4, 53763, 67076, 67 1 02, 44 1 8 1 , 67084FL, pr 67084alt polypeptide or. 
biologically active portion thereof is contacted with a test compound and the ability of the 
test compound to modulate 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt activity is determined. 

Determining the ability of the test compound to modulate 8099 or 46455 activity can 

• ' ' ' • 

20 be accomplished by monitoring, for example, intracellular or extracellular D-glucose, D- 
fructose, D-galactose, and/or mannose concentration, or insulin or glucagon secretion. The 
cell, for example, can be of mammalian origin, e.g., a liver cell, fat cell, muscle cell, or a 
blood cell, such as an erythrocyte. 

Determining the ability of the test compound to modulate 54414 or 53763 acti vity 

25 can be accomplished by monitoring, for example, potassium current, neurotransmitter 

release, and/or membrane excitability in a cell Which expresses 54414 or 53763. The cell, 
for example, can be of mammalian origin, e.g., a neuronal cell. 

Determining the ability of the test compound to modulate 67076, 67102, 441 81, 
67084FL, or 67084alt activity can be accomplished by monitoring, for ex,ample, (i) 

30 interaction of 67076, 67102, 441 81, 67084FL, or 67084alt with a 67076, 67102, 44181, 
67084FL, or 67Q84alt substrate or target molecule (e.g., a phospholipid, ATP, or a non- 
' 67076, 67 1 02, 44 1 8 1 9 67084FL, or 67084alt protein); (ii) transport of a 67076, 67 1 02, 
44 1 8 1 , 670 84FL, or 67084alt substrate or target molecule {e.g. , an aminophospholipid such 
as phosphatidylserine or phosphatidylethanolamine) from one side of a cellular membrane 

35 to the other; (iii) the ability of 67076, 67102, 44181, 67084FL, or 67084alt to be 

phosphorylated or dephosphorylated; (iv) adoption by 67076, 67102, 44181, 67084FL, or 
67084alt of an El conformation or an E2 conformation; (v) conversion of a 67076, 67102, 
441 81, 67084FL, or 67084alt substrate or target molecule to a product {e.g., hydrolysis of 
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ATP); (vi) interaction of 67076, 67102, 44181, 67084FL, or 67084alt with a second non- 
67076, 67 1 02, 44 1 8 1 ,' 67084FL 1 ; or 67084alt protein; (vii) modulation of Substrate or target 
molecule location (e.g.,, modulation of phospholipid location within a cell and/or location 
with respect to a cellular membrane); (viii) maintenance of aminophospholipid gradients; 
5 (ix) modulation of intra- or intercellular signaling and/or gene transcription (e.g., either 
directly of indirectly); and/or (x) modulation of cellular proliferation, growth, 
differentiation, apoptosis, absorption, and/or secretion. ' •- 

The ability of the test compound to modulate' 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt binding to a substrate or to bind to 8099, 46455, 

10 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt can also be determined. 

i 1 

Determining the ability of the test compound to modulate 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt binding to a substrate can be accomplished, for 
example, by coupling the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt substrate with a radioisotope or enzymatic label such that binding of the 8099, 
15 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt substrate to 8099, . 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt can be determined by 
detecting the labeled 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

i 

67084alt substrate in a complex. Alternatively, 8099, 46455, 54414, 53763, 67076, 67102, 
44 1 81 , 67084FL, or 67084alt could be coupled with a radioisotope or enzymatic label to 

20 monitor the ability of a test compound to modulate 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt binding to a 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt substrate in a complex. Determining the ability of the 
test compound to bind 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt can be accomplished, for example, by coupling the 8099, 46455, 54414, 53763, 

25 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt substrate with a radioisotope or enzymatic 
label such that binding of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt substrate to 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt can be determined by detecting the labeled 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt substrate in a complex. Alternatively, 8099, 46455, 

30 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt could be coupled with a 

radioisotope or enzymatic label to monitor the ability of a test compound to modulate 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt binding to a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt substrate in a complex. 
Determining the ability of the test compound to bind 8099, 46455, 54414, 53763, 67076, 

35 67 1 02, 44 1 8 1 , 67084FL, or 67084alt can be accomplished, for example, by coupling the 
compound with a radioisotope or enzymatic label such that binding of the compound to 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt can be determined 
by detecting the labeled 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
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67084alt compound in a complex. For example, compounds (e.g. , $099, 46455, 544 14, 
53 763, 67076, 67 1 02/ 44 1 8% 67084FL, or 67084alt substrates) can be labeled with } 25*1, 
35 S, 1 4 C, or 3h, either directly or indirectly, and the.radioisotope deteqted by direct 
counting of radioemmission or by scintillation counting. Alternatively, compounds can be 
5 enzymatically labeled with, for example, horseradish peroxidase, alkaline phosphatase, or 
luciferase, and the enzymatic label detected by determination of conversion of an 
appropriate substrate to product. ' 

It is also within the scope of this 1 invention to' determine the ability of a compound 
(eg.,-* 8099, 464.55, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt substrate) 

10 to interact with 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
without the labeling of any of the interactants. For example, a microphysiometer can be 
used to detect the interaction of a compound with 8099, 46455, 54414, 53763, 67076 
67102, 44181, 67084FL, or 67084alt without the labeling of either the compound or the 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . McConnell, H. 

15 M. et al. (1992) Science 257:1906-1912. As used herein, a "microphysiometer" (e.g., 
Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its 
environment using a light-addressable potentiometric sensor (LAPS). Changes in this 
acidification rate can be used as an indicator of the interaction between a compound and 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt . 

20 In another embodiment, an assay is a cell-based assay comprising contacting a cell 

expressing a 8099, 46455, 54414, 53763, 67076, 67102, 441 81 , 67084FL," or 67084alt 
target molecule (e.g., a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt substrate) with a test compound and determining the ability of the test compound 
to modulate (e.g., stimulate or inhibit) the activity of the 8099, 46455, 54414, 53763,67076, 

25 67102, 44181, 67084FL, or 67084alt target molecule. Determining the ability of the test 
compound to modulate the activity of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt target molecule can be accomplished, for example, by determining 
the cellular location of the target molecule, or by determining whether the target molecule 
(e.g., ATP) has been hydrolyzed. 

30 Determining the ability of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

67084FL, or 67084alt polypeptide, or a biologically active fragment thereof, to bind to or 
interact with a 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt 
target molecule can be accomplished by one of the methods described above for determining 
direct binding. In a preferred embodiment, determining the ability of the 8099, 46455, 

35 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide to hind to or 
interact with a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
target molecule can be accomplished by determining the activity of the target molecule. For 
example, the activity of the target molecule can be determined by detecting the cellular 
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location of target molecule, detecting catalytic/enzymatic activity of the target molecule 
upon an appropriate substrate, detecting induction of a metabolite of the target molecule 
{e.g., detecting the products of ATP hydrolysis, changes in intracellular K + levels) detecting 
the induction of a reporter gene (comprising a target-responsive regulatory element 

5 operatively linked to a nucleic acid encoding a detectable marker, e.g., luciferase), or 
detecting a target-regulated cellular response (i.e., membrane excitability, or cell growth, 
proliferation, differentiation, or apoptosis, sugar transport). ; 

In yet another embodiment, an assay of the present invention is a cell-free assay in 
which a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

10 polypeptide or biologically active portion thereof is contacted with a test compound and the 
• ability of the test compound to bind to the 8099, 46455, 54414, 53763, 67076, 67102, 1 
44 1 8 1 , 67084FL, or 67084alt polypeptide or biologically active portion thereof is 
determined. Preferred biologically active portions of the 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt polypeptides to be used in assays of the 

15 present invention include fragments which participate in interactions with non-8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 3 molecules, e.g., fragments 
with high surface probability scores (see, for example, Figures 2, 9, 13, 17, 21, 25, 29, 33, 
and 37). Binding of the test compound to the 8099, 46455, 54414, 53763, 67076, 67102, 
441 8 1 , 67084FL, or 67084alt polypeptide can be determined either directly or indirectly as 

20 described above. In a preferred embodiment, the assay includes contacting the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or biologically 
active portion thereof with a known compound which binds 8099, 46455, 54414, 53763, 
67076, 67102, 441 81, 67084FL, or 67084alt to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability of the test compound to interact 

25 With a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

polypeptide, wherein determining the ability of the testxiompound to interact with a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide comprises 
determining the ability of the test compound to preferentially bind to 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt or biologically active portion thereof as 

30 compared to the known compound. 

In another embodiment, the assay is a cell-free assay in which a 8099, 46455, 54414, 
' 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or biologically active 
portion thereof is contacted with a test compound and the ability of the test compound to 
modulate (e.g., stimulate or inhibit) the activity of the 8099, 46455, 54414, 53763, 67076, 

35 671 02, 44 1 8 1 , 67084FL, or 67084alt polypeptide or biologically active portion thereof is 
determined. Determining the ability of the test compound to modulate the activity of a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide can 
. be accomplished, for example, by determining the ability of the 8099, 46455, 54414, 53763, 
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67076, 67102, 44181, 67084FL, or 67084alt polypeptide to bind to a 8099, 46455, 54414 
53763, 67076, 67102/44181, 67u84FL, or 67084alt target molecule by one of the methods 
described above for determining direct binding. Determining the ability of the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide to bind to a 8099 ' 
5 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt target molecule can also 
be accomplished using a technology such as real-time Biomolecular Interaction Analysis 
(BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal. Ctiem. 63:2338-2345 and Szabo et al. 
(1995) Curr. Opin. Struct. Biol 5:699-705. As used herein, "BIA" is a technology for : 
studying biospecific interactions in real time, without labeling any of the interactants (e.g., 
10 BlAcore). Changes in the optical phenomenon of surface plasmon resonance (SPR) can be 
used as an indication of real-time reactions between biological molecules. 

Iri an alternative embodiment, determining the ability of the test compound to , 

modulate the activity of a 8099, 46455, 54414, 53763, 67076, 67)02, 44181, 67084FL, or 
67084alt polypeptide can be accomplished by determining the ability of the 8099, 46455 

15 54414, 53763, 67076, 67102, 44181 , 67084FL, or 67084alt polypeptide to further modulate 
the activity of a downstream effector of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt target molecule. For example, the activity of the effector molecule 
on an appropriate target can be determined or the binding of the effector to an appropriate 
target can be determined as previously described. 

20 In yet another embodiment, the cell-free assay involves contacting a 8099, 46455 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or biologically ' 
active portion thereof with a known compound which hinds the 8099, 46455, 54414, 53763 
67076, 67102, 44181, 67084FL, or 67084alt polypeptide to form an assay mixture, 
contacting the assay mixture with a test compound, and determining the ability of the test 

25 compound to interact with the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide, wherein determining the ability of the test compound to interact ' 
with the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide comprises determining the ability of the 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide to preferentially bind to or modulate the 

30 activity of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt target 
molecule. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of isolated proteins (e.g., 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt proteins or biologically active portions thereof 
35 ). In the case of cell-free assays in which a membrane-bound form of an isolated protein is 
used it may be desirable to utilize a solubilizing agent such that the membrane-bound form 
of the isolated protein is maintained in solution. Examples of such solubilizing agents 
include non-ionic detergents such as n-octylglucoside, n-dodecylglucoside, n- 
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dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, Triton® X- 
100, Triton® X-l 14, Thesit®, Isotridecypoly(ethylene glycol ether) n , 3-[(3- 
cholarrndopropyl)dimethylarnminio]-l -propane sulfonate (CHAPS), 3-[(3- 
cholamidopropyl)dimethylarnminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N- 
5 dodecyl=N,N-dimethyl-3-anunonio-l -propane sulfonate. 

In more than one embodiment of the above assay methods of the present invention, it 
may be desirable to immobilize either 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt or its target molecule to facilitate separation of complexed from 
uncomplexed forms of one or both of the proteins, as well as to accommodate, automation of 
10 the assay. Binding of a test cdmpound to a 8099, 46455, 54414,'53763, 67076, 67102, 
' 44181, 67084FL, or 67084alt polypeptide, or interaction of a 8099, 46455, 54414, 53763,' 
67076, 67 1 02, 44 18 1 , 67084FL, or 67084alt polypeptide with a target molecule in the 
presence and absence of a candidate compound, can be accomplished in any vessel suitable 
for containing the reactants. Examples of such vessels include microtiter plates, test tubes, 
15 arid micro-centrifuge tubes. In one embodiment, a fusion protein can be provided which 
adds a domain that allows one or both of the proteins to be bound to a matrix. For example, 
glutathione-S-transferase/8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt fusion proteins or glutathione-S-transferase/target fusion proteins can be adsorbed 
onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
20 derivatized micrometer plates, which are then combined with the test compound or the test 
compound and either the non-adsorbed target protein or 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide, and the mixture incubated under 
conditions conducive to complex formation (e.g. , at physiological conditions for salt and 
pH). Following incubation, the beads or micrometer plate wells are washed to remove any 
25 unbound components, the matrix immobilized in the case of beads, complex determined 
either directly or indirectly, for example, as described above. Alternatively, the complexes 
can be dissociated from the matrix, and the level of 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt binding or activity determined using standard 
techniques. 

30 Other techniques for immobilizing proteins on matrices can also be used in the 

screening assays of the invention. For example, either a 8099, 46455, 54414, 53763, 67076, 
' 67102, 44181, 67084FL, or 67084alt polypeptide or a 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt target molecule can be immobilized utilizing 
conjugation of biotin and streptavidin. Biotinylated 8099, 46455, 54414, 53763, 67076, 

35 67102, 44181, 67084FL, or 67084alt polypeptide, substrate, or target molecules can be 

prepared from biotin-NHS (N-hydroxy-succinimide) using techniques known in the art (e.g., 
biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive with 
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8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or 
target molecules but which do not interfere with binding of the 8099, 46455, 54414, 53763,, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptide to its target molecule can be 
derivsttized to the wells of the plate, and unbound target or 8099, 46455, 54414, 53763, 
5 67076, 67102, 44181, 67084FL, or 67084alt polypeptide trapped in the wells by antibody 
conjugation. Methods for detecting such complexes, in addition to those described above 
for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the 8099, 46455, 54414,i53763, 67076, 67102, 441 Si, 67084FL, or 
67084alt polypeptide or target molecule, as well as enzyme-linked assays which rely on 
10 detecting an enzymatic activity associated with the 8099, 46455,- 54414, 53763, 67076, 
i 67102, 44181, 67084FL, or 67084alt polypeptide or target molecule. " • " 

In another embodiment, modulators of 8099, 46455, 54414, 53763, 67076, 67102, 
44 1 8 1 , 67084FL, or 67084alt expression are identified in a method wherein a cell is 

i i 

contacted with a candidate compound and the expression of 8099, 46455, 54414, 53763, 

15 67076, 67102, 44181, 67084FL, or 67084alt mRNA or polypeptide in the cell is 

determined. The level of expression of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt mRNA or polypeptide in the presence of the candidate compound is 
compared to the level of expression of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt mRNA or polypeptide in the absence of the candidate compound, 

20 The candidate compound can then be identified as a modulator of 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt expression based on this comparison. 
For example, when expression of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt mRNA or polypeptide is greater (statistically significantly greater) in 
the presence of the candidate compound than in its absence, the candidate compound is 

25 identified as a stimulator of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt mRNA or polypeptide expression. Alternatively, when expression of 8099, 
. 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA or polypeptide is 
less (statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of 8099, 46455, 54414, 53763, 

30 67076, 67102, 44181, 67084FL, or 67084alt mRNA or polypeptide expression. The level 
of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA or 
polypeptide expression in the cells can be determined by methods described herein for 
detecting 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA 
or polypeptide. 

35 In yet another aspect of the invention, the 8099, 46455, 544 14, 53763, 67076, 671 02, 

441 81, 67084FL, or 67084alt polypeptides can be used as "bait proteins" in a two-hybrid 
assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,317; Zervos et al. (1993) Cell 
72:223-232; Madura et al (1993) J. Biol Chem. 268:12046-12054; Bartei et al (1993) 
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Biotechniques 14:920-924; Iwabuchi et al (1993) Oncogene 8:1693-1696; and Brent 
WO94/10300), to identify other proteins, which bind to or interact with $099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt ("8099, 46455, 544 M, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt -binding proteins" or "8099, 46455, 54414, 53763, 
5 67076, 67102, 441 81, 67084FL, or 67084alt -bp") and are involved in 8099, 46455, 54414, 
53763,67076, 67102, 44181, 67084FL, or 67084alt activity. Such 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt -binding proteins are also likely to be 
involved in the propagation of signals by the 8099, 46455, 544 1 4, 53763, 67076, 67 1 02, 
44181, 67084FL,-or 67084alt polypeptides or 8099, 46455, 54414, 53763, 67076, 67102, 

10 44 1 8 1 , 67084FL, or 67084alt targets as, for example, downstream elements of a 8099, 
46455, 544 14, 53763,' 67076, 67102, 441 8 1 , 67084FL, or 67084alt -mediated signaling 
pathway.' Alternatively, such 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt. -binding proteins are likely to be 8099, 46455, 54414, 53763, 67076, 67102, 
44 1 8 1 , 67084FL, or 67084alt inhibitors. • 

15 The two-hybrid system is based on the modular nature of most transcription factors, 

which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one cpnstruct, the gene that codes for a 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide is fused to a gene 
encoding the DNA binding domain of a known transcription factor (e.g. , GAL-4). In the 

20 other construct, a DNA sequence, from a library of DNA sequences, that encodes an 
unidentified protein ("prey" or "sample") is fused to a gene that codes for the activation 
domain of the known transcription factor. If the "bait" and the "prey" proteins are able to 
interact, in vivo, forming a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt -dependent complex, the DNA-binding and activation domains of the transcription 

25 factor are brought into close proximity. This proximity allows transcription of a reporter 
gene (e.g. , LacZ) which is operably linked to a transcriptional regulatory site responsive to 
the transcription factor. Expression of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be. isolated and used to obtain the cloned 
gene which encodes the protein which interacts with the 8099, 46455, 54414, 53763, 67076, 

30 67102, 441 81 , 67084FL, or 67084alt polypeptide. 

In another aspect, the invention pertains to a combination of two or more of the 
assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide can be 

35 confirmed in vivo, e.g., in an animal such as an animal model for cellular transformation 
and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an agent 
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identified as described herein in an appropriate animal model. For example, an agent 
identified as described* herein (e.g., a 8099, 46455, 54414, 53763*, 67076, 67102, 44181, 

67084FL, or 67084alt modulating agent, an antisense 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt nucleic acid molecule, a 8099, 46455, 54414, 53763, 
5 67076, 67102, 44181, 67084FL, or 67084alt -specific antibody, or a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt -binding partner) can be used in an 
animal model to determine the efficacy, toxicity, or side effects of treatment with such an 
agent. Alternatively, an agent identified as described herein can be Used in an animal model 
to determine the mechanism of action of such an agent. Furthermore, this invention pertains 
10 to uses of novel agents identified by the above-described screening assays for treatments as 
described herein. ■■' ■ 

B. . Detection Assays 

Portions or fragments of the cDNA sequences identified herein (and the 
15 corresponding complete gene sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: (i) map their respective genes on a 
chromosome; and, thus, locate gene regions associated with genetic disease; (ii) identify an 
individual from a minute biological sample (tissue typing); and (iii) aid in forensic 
identification of a biological sample. These applications are described in the subsections 
20 below. 

1 . Chromosome Mapping 

Once the sequence (or a portion Of the sequence) of a gene has been isolated, this 
sequence can be used to map the location of the gene on a chromosome. This process is 
25 called chromosome mapping. Accordingly, portions or fragments of the 8099, 46455, 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleotide sequences, described 

herein, can be used to map the location of the 8099, 46455, 54414, 53763, 67076, 67102, 
44181,67084FL, or67084alt genes on a chromosome. The mapping of the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt sequences to chromosomes is 

30 an important first step in correlating these sequences with genes associated with disease. 

Briefly, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
genes can be mapped to chromosomes by preparing PCR primers (preferably 15-25 bp in 
length) from the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
nucleotide sequences. Computer analysis of the 8099, 46455, 54414, 53763, 67076, 67102, 

35 44 1 81 , 67084FL, or 67084alt sequences can be used to predict primers that do not span 
more than one exon in the genomic DNA, thus complicating the amplification process. 
These primers can then be used for PCR screening of somatic cell hybrids containing 
individual human chromosomes. Only those hybrids containing the human gene 
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corre'sponding to the 80.99, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing somatic ^ cells from different mammals 
(e.g., human and mouse cells). As hybrids of human and mouse cells grow and divide, they 

5 gradually lose human chromosomes in random order/but retain the mouse chromosomes; 
By using media in which mouse cells cannot grow, because they lack a particular enzyme, 
but human cells can, the one human chromosome that contains the gene encoding the needed 
enzyme, will be retained. By using various media, panels of hybrid cell lines can be 
established. Each cell line in a panel contains either a single human chromosome or a small 

10 number of human chromosonies, and a full set of mouse chromosomes, allowing easy 
■ mapping of individual genes to specific human chromosomes (D'Eustachio P. et ai (1983) 
Science 220:919-924). Somatic cell hybrids containing only fragments of human 

chromosomes can also be produced by using human chromosomes with translocations and 

i » . ' /. . . 

deletions. 

15 PCR mapping of somatic cell hybrids is t a rapid procedure for assigning a particular 

sequence to a particular chromosome. Three or more sequences can be assigned per day 
using a single thermal cycler. Using the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt nucleotide sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments from specific chromosomes. 
20 Other mapping strategies which can similarly be used to map a 8099, 46455, 54414, 53763, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt sequence to its chromosome include in situ 
hybridization (described in Fan, Y. et al (1990) Proc. Natl Acad Set USA, 87:6223-27), 
pre-screening with labeled flow-sorted chromosomes, and pre-selection by hybridization to 
chromosome specific cDNA libraries. 
25 Fluorescence in situ hybridization (FISH) of a DNA sequence to a metaphase 

chromosomal spread can further be used to provide a precise chromosomal location in one 
• step. Chromosome spreads can be made using cells whose division has been blocked in 
metaphase by a chemical such as colcemid that disrupts the mitotic spindle. The 
chromosomes can be treated briefly with trypsin, and then stained with Gjemsa. A pattern 
30 of light and dark bands develops on each chromosome, so that the chromosomes can be 
identified individually. The FISH technique can be used with a DNA sequence as short as 
500 or 600 bases. However, clones larger than 1 ,000 bases have a higher likelihood of 
binding to a unique chromosomal location with sufficient signal intensity for simple 
detection. Preferably 1 ,000 bases, and more preferably 2,000 bases will suffice to get good 
35 results at a reasonable amount of time. For a revie w of this technique, see Verma et al. , 
Human Chromosomes: A Manual of Basic Techniques (Pergamon Press, New York 1988). 

Reagents for chromosome mapping can be used individually to mark a single 
chromosome or a single site on that chromosome, or panels of reagents can be used for 
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marking multiple sites and/or multiple chromosomes. Reagents corresponding to noncoding 
regions of the genes actually are preferred for mapping purposes. Coding sequences are . 
more likely to be conserved within gene families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 
5 Once a sequence has been mapped to a precise chromosomal location, the physical 

position of the sequence on the chromosome can be correlated with genetic map data. (Such 
data are found, for example, in V. McKu'sick, Mendelian Inheritance in Man, available on- 
line through Johns Hopkins University Welch Medical Library). The relationship between a 
gene and a disease, mapped to the same chromosomal region, can then be identified through 

1 0 linkage analysis (co-inheritance of physically adjacent genes), described in, for example, 
i Egeland, J; et al. (1 987) Nature, 325:783-787. 

Moreover, differences in the DNA sequences between individuals affected and 
unaffected with a disease associated with the 8099, 46455, 54414, 53763, 67076, 67102, 
44 1 8 1 , 67084FL, or 67084alt gene, can be determined. If a mutation is observed in some or 

15 all of the affected individuals but not in any unaffected individuals, then the mutation is 
likely to be the causative agent of the particular disease. Comparison of affected and 
unaffected individuals generally involves first looking for structural alterations in the 
chromosomes, such as deletions or translocations that are visible from chromosome spreads 
or detectable using PCR based on that DNA sequence. Ultimately, complete sequencing of 

20 genes from several individuals can be performed to confirm the presence of a mutation and 
to distinguish mutations from polymorphisms. 

2. Tissue Typing 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
25 sequences of the present invention can also be used to identify individuals from minute 
biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In this 
technique, an individual's genomic DNA is digested with one or more restriction enzymes, 
and probed on a Southern blot to yield unique bands for identification. This method does 
30 not suffer from the current limitations of "Dog Tags" .which can be lost, switched, or stolen, • 
making positive identification difficult. The sequences of the present invention are useful as 
additional DNA markers for RFLP (described in U.S. Patent 5,272,057). 

. Furthermore, the sequences of the present invention can be used to provide an 
alternative technique which determines the actual base-by-base DNA sequence of selected 
35 portions of an individual's genome. Thus, the 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt nucleotide sequences described herein can be used to prepare 
two PCR primers from the 5' and 3' ends of the sequences. These primers can then be used 
to amplify an individual's DNA and subsequently sequence it. 
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Panels of corresponding DNA sequences from individuals, prepared in this manner, 

can provide unique individual identifications, as each individual will have! a unique set of 

such DNA sequences due to allelic differences. The sequences of the present invention can 

be used to obtain such identification sequdrices from individuals and from tissue/ The 8099, 

5 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt nucleotide sequences of 

the invention uniquely represent portions of the human genome. Allelic variation occurs to 

some degree in the coding regions of these sequences, arid to a greater degree in the 
■■ ■ • • . ■ ■ , 

noncoding regions. It is estimated that Allelic variatibn between individual humans occurs 

with a frequency ,of about once per each 500 bases. Each of the sequences described herein 

10 can, to some degree, be used as a standard against which DNA from an individual can be 

compared for identification purposes. Because greater numbers of polymorphisms occur in 

the noncoding regions, fewer sequences are necessary to differentiate individuals. The 

noncoding sequences of SEQ ID NO: 1 or SEQ ID NO:4 can comfortably provide positive 

individual identification with a panel of perhaps 10 to 1,000 primers which each yield a 

15 noncoding amplified sequence pf 100 bases. If predicted coding sequences, such as those in 

SEQ ID NO:3 or SEQ ID NO:6 are used, a more appropriate number of primers for positive 

individual identification would be 500-2,000. 

If a panel of reagents from 8099," 46455, 54414, 53763, 67076, 67102, 44181, 

67084FL, or 67084alt nucleotide sequences described herein is used to generate a unique 

20 identification database for an individual, those same reagents can later be used to identify 

tissue from that individual. Using the unique identification database, positive identification 

of the individual, living or dead, can be made from extremely small tissue samples. 

3. Use of 8099, 46455, 54414, 53763, .67076, 67102, 44181, 67084FL, or 67084alt 

25 Sequences in Forensic Biology 

DNA-based identification techniques can also bp used in forensic biology. Forensic 
biology is a scientific field employing genetic typing of biological evidence found at a crime 
scene as a means for positively identifying, for example, a perpetrator of a crime. To make 
such an identification, PCR technology can be used to amplify DNA sequences taken from 

30 very small biological samples such as tissues, e.g., hair or skin, or body fluids, e.g., blood, 
saliva, or semen found at a crime scene. The amplified sequence can then be compared to a 
standard, thereby allowing identification of the origin of the biological sample. 

The sequences of the present invention can be used to provide polynucleotide 
reagents, e.g., PCR primers, targeted to specific loci in the human genome, which can 

35 enhance the reliability of DNA-based forensic identifications by, for example, providing 
another "identification marker" (i.e. another DNA sequence that is unique to a particular 
individual). As mentioned above, actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by restriction enzyme generated 
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fragments. Sequences targeted to noncoding regions of SEQ ID NO: 1 or SEQ ID NO:4 are 
particularly appropriate for this use as greater numbers of polymorphisms occur in the 
noncoding regions, making it easier to differentiate individuals using this technique. 
Examples of polynucleotide reagents include the 8099^ 46455, 544.14, 53763, 67076, 671 02, 
5 44181, 67084FL, or 67084alt nucleotide sequences or portions thereof, e.g., fragments 
derived from the noncoding regions of SEQ ID NO:l, SEQ ID NO:4, SEQ ID NO:7, SEQ 
ID NO: 1 0, or SEQ ID NO; 1 3, having a length' of at least 20 bases, preferably at least 30 
bases. , i 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
1 0 nucleotide sequences described herein can further be Used to provide polynucleotide 
!. reagents, e.g. , labeled or labelabje probes which can be used in, for example, an in situ ' 
hybridization technique, to identify a specific tissue, e.g. , brain tissue. This can be very 
useful in cases where a forensic pathologist is presented with a tissue of unknown origin. 
Panels of such 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
15 probes can be used to identify tissue by species.and/or by organ type. 

In a similar fashion, these reagents, e.g., 8099, 46455, 54414, 53763, 67076, 67102, 
441 81, 67084FL, or 67084alt primers or probes can be used to screen tissue culture for 
contamination (i.e. screen for the presence of a mixture of different types of cells in a 
culture). 

20 

C. Predictive Medicine : 

The present invention also pertains to the field of predictive medicine in which 

diagnostic assays, prognostic assays, and monitoring clinical trials are used for prognostic 
(predictive) purposes to thereby treat an individual prophylactically. Accordingly, one 

25 aspect of the present invention relates to diagnostic assays for determining 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide and/or nucleic acid 
> expression as well as 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt activity, in the context of a biological sample (e.g. , blood, serum, cells, tissue) to 
thereby determine whether an individual is afflicted with a disease or disorder, Or is at risk 

30 of developing a disorder, associated with aberrant or unwanted 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt expression or activity. The invention also ' 
provides for prognostic (or predictive) assays for determining whether an individual is at 
risk of developing a disorder associated with 8099, 46455, 54414, 53763, 67076, 671 02, 
44181, 67084FL, or 67084alt polypeptide, nucleic acid expression or activity. Forexample, 

35 mutations in a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
gene can be assayed in a biological sample. Such assays can be used for prognostic or 
predictive purpose to thereby prophylactically treat an individual prior to the onset of a 
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disorder characterized by or associated with 8099, 46455, 54414, 53763, 67076, 67102, 
44 1 8 1 , 67084FL, or 67084alt polypeptide* nucleic acid expression or activity. 

Another aspect of the invention pertains to monitoring the influence of agents (e.g. , 
drugs, compounds) on the expression or activity of 8099, 46455, 54414, 53763, 67076, 
5 67102,44181, 67084FL, or 67084alt in clinical trials; 

These and other agents are described in further detail in the following sections, 

1. Diagnostic Assays • 
An exemplary method for detecting the presence or absence of 8099, 4645 5> 544 1 4, 
10 53763, 67076, 67102, 44181, 67p84FL, or 67084alt polypeptide or nucleic acid in a 

biological sample involves obtaining a biological sample from a test subject and contacting 
the biological sample with a compound or an agent capable of detecting 8099, 46455, . 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or nucleic acid 
(e.g. , mRNA, or genomic DNA) that encodes 8099, 46455, 54414, 53763, 67076, 67 1 02, 
15 441 81, 67084FL, or 67084alt polypeptide such that the presence of 8099, 46455, 54414,- 
5 3 763 , 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptide or nucleic acid is detected 
in the biological sample. In another aspect, the present inventidn provides a method for 
detecting the presence of 8099, 46455, 54414, 53763, 67076,67102, 44181, 67084FL, or 
67084alt activity in a biological sample by contacting the biological sample with an agent 
20 capable of detecting an indicator of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

67084FL, or 67084alt activity such that the presence of 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt activity is detected in the biological sample. A 
preferred agent for detecting 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt mRNA or genomic DNA is a labeled nucleic acid probe capable of hybridizing to 
25 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA or 

genomic DNA. The nucleic acid probe can be, for example, the 8099, 46455, 54414, 53763, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt nucleic acid set forth in SEQ ID NO : 1 , SEQ 
ID NO:3, SEQ ID NG:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, 
SEQ ID NO:12, SEQ IDNO.13, SEQ ID NO:l5, SEQ ID NO:16, SEQ ID NO:18, SEQ ID 
30 NO:19, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO: 1 6, SEQ ID NO: 1 8, SEQ ID NO:19, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:27, or the DNA insert of the plasmid deposited with 

ATCC as Accession Number , , _>• _> ° r _ > or a 

portion thereof, such as an oligonucleotide of at least 15, 30, 50, 100, 250 or 500 nucleotides 
35 in length and sufficient to specifically hybridize under stringent conditions to 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 670 84FL, or 67084alf mRNA or genomic DNA. 
Other suitable probes for use in the diagnostic assays of the invention are described herein. 
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A preferred agent for detecting 8099, 46455, 54414, 53763, 67076, 67102, 44181 
67084FL, or 67084alf polypeptide is an antibody capable of binding to 8oW 46455 : 54414 
53763,67070, 67102, 4418.1, 67084FL, or 67084alt polypeptide, preferably an antibody ' 
with a detectable label. Antibodies can bepolyclonal, or more preferably, monoclonal. An 
> intact antibody, or a fragment thereof (e.g., Fab or F(ab-)2) can be used. The term "labeled" 
With regard to the probe or antibody, is intended to encompass direct labeling of the probe or 
antibody by coupling (i.e., physically ^ linking) a detectable substance to the probe or 
antibody, as well as indirect labeling of the probe or antibody by reactivity with another' 
reagent that is directly labeled. Examples of indirect labeling include detection of a primary 
anybody using a fluor. scently labeled secondary antibody and end-labeling of a DNA probe 
With biotm such that it can be detected with fluorescently labeled streptavidin The term 
"biological sample" is intended to include tissues, cells and biological fluids isolated from a 
subject, as well as tissues, cells and fluids present within a subject. That is, the detection 
method of the invention can be used to detect'8099, 46455, 54414, 53763 67076 67102 
44181, 67084FL, or 67084alt mRNA, polypeptide, or genomic DNA in a'biological sample 
/« vi/roaswellasmv/vo. For example, in vitro techniques for'detection of 8099 46455 
54414, 53763, 67076, 67102, 44181, 67 P 84FL, or 67084alt mRNA include Northern ' 
hybridizations and in situ hybridizations. In vitro techniques for detection of 8099 46455 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide include enzyme ' 
hnked immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. In vitro techniques for detection of 8099, 46455 54414 53763 
67076, 67102, 44181, 67084FL, or 67084alt genomic DNA include Southern' 
hybridizations. Furthermore, in vivo techniques for detection of PLTR polypeptide include 
introducing into a subject a labeled 8099, 46455, 54414, 53763, 67076, 67102 44181 
67084FL, or 67084alt antibody. For.example, the antibody can be labeled with a 
radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

The present invention also provides diagnostic assays for identifying the presence or 
absence of a genetic alteration characterized by at least one of (i) aberrant modification or 
mutation of a gene encoding a 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL 
or67084alt polypeptide; (ii) aberrant expression of a gene encoding a 8099 46455 54414 ' 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide; (iii) mis-regulation of the 
gene; and (in) aberrant post-translational modification of a 8099 46455 54414 53763 
67076, 67102, 44181, 67084FL, or. 67084* polypeptide, wherein a wiid-type form of the 
gene encodes a polypeptide with a 8099, 46455, 54414, 53763, 67076, 67102 44181 
67084FL, or67084alt activity. "Misexpression or aberrant expression", as used herein 
refers to a non-wild type pattern of gene expression, at the RNA or protein level It 
includes, but is not limited to, expression at non-wild type levels (e.g. , over or under 
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expression); a pattern of expression that differs from wild type in terms of the time or stage 
••■ at which the gene is expressed (e.g. ,. increased or decreased expression (as compared with . 
wild type) at a predetermined developmental period or stage); apatterh of expression that 
differ^ from wild type in terms of decreased expression (as compared with wild type) in a 
5 predetermined cell type or tissue type; a pattern of expression that differs from wild type in 
terms of the splicing size, amino acid sequence, post-transitional modification, or biological 
activity of the expressed polypeptide; a pattern of expression that differs from wild type in 
terms of the effect of an environmental stimulus or extracellular stimulus on expression of 
the gene (e.g. , a pattern of increased or decreased expression (as compared with- wild type) 
10 in the presence of an increase or decrease in the strength of the stimulus). 

. In. one embodiment, the biological sample contains protein molecules from the test 

subject. Alternatively, the biological sample can contain mRNA molecules from the test 
subject or genomic DNA molecules from the test subject. A preferred biological sample is a 
serum sample isolated by conventional means from a subject. 
1 5 In another embodiment, the methods further involve obtaining a control biological 

sample from a control subject, contacting the control sample with a compound or agent 
capable of detecting 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide, mRNA, or genomic DNA, such that the presence of 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide, mRNA or genomic 
20 DNA is detected in the biological sample, and comparing the presence of 8099, 46455, 

54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide, mRNA or genomic 
DNA in the control sample with the presence of 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt polypeptide, mRNA or genomic DNA in the test sample. 
The invention also encompasses kits for detecting the presence of 8099, 46455, 
25 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt in a biological sample. For 
example, the kit can comprise a labeled compound or agent capable of detecting 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or mRNA 
in a biological sample; means for determining the amount of 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt in the sample; and means fo,r comparing the 
30 amount of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt in the 
sample with a standard. The compound or agent can be packaged in a suitable container. 
' The kit can further comprise instructions for using the kit to detect 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or nucleic acid. 

35 2. Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify 
subjects having or at risk of developing a disease or disorder associated with aberrant or 
unwanted 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
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expression or activity. As used herein, the term "aberrant" includes a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, . or 67084alt expression or activity which deviates ■ 
from the wild type 8099, 46455, 54414, 53763, 67076" 67102, 44181, 67084FL, or 67084alt 
expression or activity. Aberrant expression or activity includes increased or decreased 
5 expression or activity, as well as expression or activity which does not follow the wild type 
developmental pattern of expression or the subcellular pattern of expression. For example, 
aberrant 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt 
expression or activity is intended to include the cSases in which a mutation in the 8099, 
46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt gene causes the 8099, 

10 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene to be under- 

expressed or over-expressed and, situations in which such mutations result in a non- 1 
functional 8099; 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide or a polypeptide which does not function in a wild-type fashion, e.g., a protein 
_ which does not interact with or transport a 8099, 46455, 54414, 53763, 67076, 67102, 

15 44181, 67084FL, or 67084alt substrate, or one which interacts with or transports a non- 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt substrate. As 
used herein, the term "unwanted" includes an unwanted phenomenon involved in a 
biological response such as deregulated cellular proliferation. For example, the term 
unwanted includes a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

20 67084alt expression or activity which is undesirable in a subject. 

The assays described herein, such as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or at risk of developing a disorder 
associated with a misregulation in 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt polypeptide activity or nucleic acid expression, such as a as a cell 

25 growth, proliferation and/or differentiation disorder. Alternatively, the prognostic assays 
can be utilized to identify a subject having or at risk for developing a disorder associated 
• with a misregulation in 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide activity or nucleic acid expression, such as a cell growth, proliferation 
and/or differentiation disorder, a sugar trnasporter associated disorder, or a potassium 

30 channel associated disorder, as described herein. Thus, the present invention provides a 
method for identifying a disease or disorder associated with aberrant or unwanted 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt expression or activity in 
which a test sample is obtained from a subject and 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide or nucleic acid {e.g., mRNA or genomic 

35 DNA) is detected, wherein the presence of 8099, 46455, 544 1 4, 53763, 67076, 67 1 02, 

441 81, 67084FL, or 67084alt polypeptide or nucleic acid is diagnostic for a subject having 
or at risk of developing a disease or disorder associated with aberrant or unwanted 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt expression or activity. 
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As used herein, a "test sample 11 refers to a biological sample obtained from a subjectof 
interest. For exampte,*a test sample can be a biological fluid (e.g. , serum)', cell sample, or 
■ tissue. . .. • i ■ • " , r 

Furthermore, the prognostic assays described herein can be used to determine 

5 whether a subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, 
protein, peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant or unwanted 8099, 46455, 54414, 53763, 67076, 67102, 
44 181, 67084FL, or 67084alt i expressidn or activity.' For example, such methods can be 
used to determine whether a subject can be effectively treated with an agent for a sugar 

JO transporter J associated disorder, a potassium channel associated disorder, or phospholipid 
transporter-associated disorder. Thus, the present invention provides methods for 
determining whether a subject can be effectively treated with an agent for a disorder 
associated with aberrant or unwanted 8099, 46455; 54414, 53763, 67076,67102, 44181, 
67084FL, or 67084alt expression or activity in which a test sample is obtained and 8099, 

15 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or nucleic 
acid expression or activity is detected (e.g., wherein the abundance of 8099,'46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or nucleic acid expression 
or activity is diagnostic for a subject that can be administered the agent to treat a disorder 
associated with aberrant or unwanted 8099, 46455, 54414, 53763, 67076, 67102, 44181,' 

20 67084FL, or 67084alt expression or activity). 

The methods of the invention can also be used to detect genetic alterations in a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene, thereby 
determining if a subject with the altered gene is at risk for a disorder characterized by 
misregulation in 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt 

25 polypeptide activity or nucleic acid expression, such as a cell growth, proliferation and/or 
differentiation disorder. In preferred embodiments, the methods include detecting, in a 
sample of cells from the subject, the presence or absence of a genetic alteration characterized 
by at least one of an alteration affecting the integrity of a gene encoding a 8099, 46455;, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -polypeptide, or the mis- 

30 expression of the 8099, 46455, 54414, 53763, 67076, 67102, 44 1 8 1 , 67084FL, or 67084alt 
gene. For example, such genetic alterations can be detected by ascertaining the existence of 
at least one of 1) a deletion of one or more nucleotides from a 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt gene; 2) an addition of one or more 
nucleotides to a 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt 

35 gene; 3) a substitution of one or more nucleotides of a 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt gene, 4) a chromosomal rearrangement of a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene; 5) an alteration in 
the level of a messenger RNA transcript of a 8099, 46455, 54414, 53763, 67076, 67102, 
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44181, 67084FL, or 670.84alt gene, 6) aberrant modification of a 8099, 46455, 54414, 
53763, 67076, 67102/44181, 67084FL, or 67084alt gene, such as of the methylation 
pattern of the genomic DNA, 7) the presence of a non-wild type splicing pattern of a 
messenger RNA transcript of a 8099, 4645$, 54414, 53763, 67076, 67102, 44181, 67084FL, 
5 or 67084alt gene; 8) a non-wild type level of a 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt -polypeptide, 9) allelic loss of a 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67Q84FL, or 67084alt gene, and 10) inappropriate post-translational 
modification of a 8099, 46455', 54414, 5*3763, 67076', 67102, 44181,' 67084FL, or 67084alt - 
polypeptide. As described herein, there are a large number of assays, known in the art which 

10 can be used'for detecting alterations in a 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 
67084FL, or 67084alt gene. A preferred biological sample is a tissue or serum sample 
isolated by conventional means from a subject. 

In certain embodiments, detection of the alteration involves the use of a probe/primer 
in a polymerase chain reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 4,683,202), 

15 such as anchor PCR or RACE PCR, or, alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegranera/. (1988) Science 241:1077-1080; and Nakazawa etal. (1994) Proc. 
Natl. Acad. Sci: USA 91:360-364), the latter of which can be particularly useful for detecting 
point mutations in the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt gene (see Abravaya etal. (1995) Nucleic Acids Res. 23:675-682), This method 

20 can include the steps of collecting a sample of cells from a subject, isolating nucleic acid 
(e.g., genomic, mRNA or both) from the cells of the sample, contacting the nucleic acid 
sample with one or more primers which specifically hybridize to a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt gene under conditions such that 
hybridization and amplification of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

25 67084FL, or 67084alt gene (if present) occurs, and detecting the presence or absence of an 
amplification product, or detecting the size of the amplification product and comparing the 
- length to a control sample. It is anticipated that PCR and/or LCR may be desirable to use as 
a preliminary amplification step in conjuncticm with any of the techniques used for detecting 
mutations described herein. 

30 Alternative amplification methods include: self sustained sequence replication 

(Guatelli, J.C. etal, (1990) Proa Natl Acad. Set USA 87:1874-1878), transcriptional 
amplification system (Kwoh, D.Y. et al, (1989) Proc. Natl Acad Scl USA 86: 1 173-1 177), 
Q-Beta Replicase (Lizardi, P.M. et al (1988) Bio-Technology 6:1197), or any other nucleic 
acid amplification method, followed by the detection of the amplified molecules using 

35 techniques well known to those of skill in the art. These detection schemes are especially 
useful for the detection of nucleic acid molecules if such molecules are present in very low 
numbers. 
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In an alternative embodiment, mutations in a 8099, 46455, .5441 4, 53763, 67076, 

.„ 67102, 44181, 67084FL, or 67084alt gene from a Sample cell can be identified by 

alterations in restriction enzyme cleayage patterns, ,For example, sample and control DNA 
is isolated, amplified (optionally), digested with one or more restriction endonucleases,' and 
5 fragment length sizes are determined by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and control DNA indicates mutations in the sample 
DNA. Moreover, the use of sequence specific .ribozymes (see, fipr example, U.S. Patent No. 
5,498,531) can be used to score for the presence pf specific mutations by development or 
loss of a ribozyme cleavage site. 

10 In other embodiments,, genetic mutations in 8099, 46455, 54414, 53763, 67076, 

, 67102, 44181, 67084FL, or 67084alt can be identified by hybridizing a sample and control 
nucleic acids, e.g. , DNA or RNA, to high density arrays containing hundreds or thousands 
of oligonucleotides probes (Cronin,M.T. et at (1996) Human Mutation .7; 244-255; Kozal, 
' M.J. et al (1996) Nature Medicine 2: 753-759). For example, genetic mutations in 8099, 

15 46455, 54414, 53763, 67076, 67102, 441 81, 67084FL, or 67084alt can be identified in two 
dimensional arrays containing light-generated DNA probes as described in Cronin, M.T. et 
al supra. Briefly, a first hybridization array of probes can be used to scan through long 
stretches of DNA in a sample and control to identify base changes between the sequences by 
making linear arrays of sequential overlapping probes. This step allows the identification of 

20 point mutations. This step is followed by a second hybridization array that allows the 
characterization of specific mutations by using smaller, specialized probe arrays 
complementary to all variants or mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild-type gene and the other complementary 
to the mutant gene. 

25 In yet another embodiment, any of a variety of sequencing reactions known in the art 

can be used to directly sequence the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
t 67084FL, or 67084alt gene and detect mutations by comparing the sequence of the sample 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt with the 
corresponding wild-type (control) sequence. Examples of sequencing reactions include 

30 those based on techniques developed by Maxam and Gilbert ((1977) Proc. Natl Acad. Sci. 
USA 74:560) or Sanger ((1977) Proc. Natl Acad. Sci. USA 74:5463). It is also 
' contemplated that any of a variety of automated sequencing procedures can be utilized when 
performing the diagnostic assays ((1995) Biotechniqites 19:448), including sequencing by 
mass spectrometry (see, e.g. , PCT International Publication No. WO 94/1 6 101 ; Cohen et al 

35 (1996) Adv. Chromatogr. 36; 127-162; and Griffin et al (1993) Appl Biochem. Biotechnol 
38:147-159). 

Other methods for detecting mutations in the 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt gene include methods in which protection from 
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cleavage agents is used to detect mismatched bases in RNA/RNA or RNA/DNA 
heteroduplexes (Myers et al (1 985) Science 230:1242). In general, the art technique of 
"mismatch cleavage" starts by providing heteroduplexes of formed by hybridizing (labeled) 
RNA or DNA containing the wild-type 8099, 46455, 54414; 53763, 67076, 67102, 44181, 
5 67084FL, or 67084alt sequence with potentially mutant RNA or DNA obtained from a 

tissue sample. The double-stranded duplexes are treated with an agent which cleaves single- 
stranded regions of the duplex such as which will e^ist due to basepair mismatches between 
the control and sample strands. For instance, RNA/DNA duplexes can be treated with I 
RNase and DNA/DNA hybrids treated with S 1 nuclease to enzymatically digesting the 

10 mismatched regions. In other embodiments, either DNA/DNA or RNA/DNA duplexes can 
be treated with hydroxylamine or osmium tetroxide and with piperidine in order to digest 
mismatched regions. After digestion of the mismatched regions, the resulting material is 
then separated by size on denaturing polyacrylamide gels to determine the site of mutation. 
_ See, for example, Cotton et al. (1988) Proc. Natl Acad Sci USA 85:4397; Saleeba et al 

15 (1992) Methods Enzymol 21,7:286-295. In a preferred embodiment, the control DNA or- 
RNA can be lajbeled for detection. 1 

In still another embodiment, the .mismatch cleavage reaction employs one or more 
proteins that recognize mismatched base pairs in double-stranded DNA (so called "DNA 
mismatch repair" enzymes) in defined systems for detecting and mapping point mutations in 

20 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt .cDNAs obtained 
from samples of cells. For example, the mutY enzyme of E. coli cleaves A at G/A 
mismatches and the thymidine DNA glycosylase from HeLa cells cleaves T at G/T 
mismatches (Hsu et al (1994) Carcinogenesis 15:1657-1662). According to an exemplary 
embodiment, a probe based on a 8099, 46455; 54414, 53,763, 67076, 67102, 44181, 

25 67084FL, or 67084alt sequence, e.g.,, a wild-type 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt sequence, is hybridized to a cDNA or other DNA 
product from a test cell(s). The duplex is treated with a DNA mismatch repair enzyme, and 
the cleavage products, if any, can be detected from electrophoresis protocols or the like. 
See, for example, U.S. Patent No. 5,459,039. 

30 In other embodiments, alterations in electrophoretic mobility will be used to identify 

mutations in 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
genes. For example, single strand conformation polymorphism (SSCP) may be used to 
detect differences in electrophoretic mobility between mutant and wild type nucleic acids 
(Orita et al (1989) Proc Natl Acad, Sci USA: 86:2766, see also Cotton (1993) Mutat. Res. 

35 285:125-144; and Hayashi (1992) Genet Anal Tech. Appl 9:73-79). Single-stranded DNA 
fragments of sample and control 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt nucleic acids will be denatured and allowed to renature. The 
secondary structure of single-stranded nucleic acids varies according to sequence, the . 
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resulting alteration in electrophoretic mobility enables the detection of even a single base 
change. The DNA fragments may be labeled or detected with labeled probes. The 
sensitivity of the assay may be enhanted by using'RNA (rather than DNA), in which the 
secondary structure is more sensitive to a change in sequence. In a preferred embodiment, 
5 the subject method utilizes heteroduplex analysis to separate double stranded heteroduplex 
molecules on the basis of changes in electrophoretic mobility (Keen et al (1 991) Trends 
Genet 7:5). ' ' i 

In yet another embodiment the movement of mutant or wild-type fragments in 
polyacrylamide gels containing a gradient of denaturant is assayed using denaturing gradient 

10 gel electrophoresis (DGGE) (Myers et al (1 985) Nature 3 13 :495). When DGGE is used as 
1 {he method of analysis, DNA will be modified to insure that it does not completely denature, 
for example by adding a GC clamp of approximately 40 bp of high-melting GC-rich DNA 
by PCR. In a further embodiment, a temperature gradient is used in place of a denaturing 
gradient to identify differences in the mobility of control and sample DNA (Rosenbaum and 

15 Reissner (1987) BiophysChem 265: 12753). , 

Examples of other techniques for detecting point mutations include, but are not 
limited to, selective oligonucleotide hybridization, selective amplification, or selective 
primer extension. For example, oligonucleotide primers may be prepared in which the 
known mutation is placed centrally and then hybridized to target DNA under conditions 

20 which permit hybridization only if a perfect match is found (Saiki et al (1 986) Nature 
324:163); Saiki et al (1989) Proc. Natl Acad Sci USA 86:6230). Such allele specific 
oligonucleotides are hybridized to PCR amplified target DNA or a number of different 
mutations when the oligonucleotides are attached to the hybridizing membrane and 
hybridized with labeled target DNA. 

25 Alternatively, allele specific amplification technology which depends on selective 

PCR amplification may be used in conjunction with the; instant invention. Oligonucleotides 
* used as primers for specific amplification may carry the mutation of interest in the center of 
the molecule (so that amplification depends on differential hybridization) (Gibbs et al (1 989) 
Nucleic Acids Res. 17:2437-2448) or at the extreme 3' end of one primer where, under 

30 appropriate conditions, mismatch can prevent, or reduce polymerase extension (Prossner 

(1993) Tibtech 1 1:23 8). In addition it may be desirable to introduce a novel restriction site in 
the region of the mutation to create cleavage-based detection (Gasparini et al (1992) M?/, 
Cell Probes 6: 1). It is anticipated that in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany (1991) Proc. Natl Acad Sci USA 

35 88:189). In such cases, ligation will occur only if there is a perfect match at the 3' end of the 
5* sequence making it possible to detect the presence of a known mutation at a specific site by 
looking for the presence or absence of amplification. 
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The methods described herein may be performed, for example, by utilizing pre- 
packaged diagnostic kits comprising at least one probe nucleic acid or antibody reagent 
described herein, which may be conveniently used^e.g:, in clinical settings to diagnose ■ 
patients exhibiting symptoms or family history of a disease or illness involving a 8099, 
5 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene. 

Furthermore, any cell type or tissue in which 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt is Expressed may be utilized in the prognostic assays 
described herein. i 1 

10 3. Monitoring of Effects During Clinical Trials 

1 ( Monitoring the influence of agents (e.g. , drugs) on the expression or activity of a * 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide (e.g., 
the modulation of gene expression, cellular signaling, 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt activity, phospholipid transporter activity, and/or cell 

15 growth, proliferation, differentiation, absorption, and/or secretion mechanisms) can be 
applied not only in basic drug screening, but also in clinical trials. For example, the 
effectiveness of an agent determined by a screening assay as described herein to increase 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 6708'4alt gene expression, 
polypeptide levels, or upregulate 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

20 67084FL, or 67084alt activity, can be monitored in clinical trials of subjects exhibiting 
decreased 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt gene 
expression, polypeptide levels, or downregulated 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt activity. Alternatively, the effectiveness of an agent 
determined by a screening assay to decrease 8099, 46455, 54414, 53763, 67076, 67102, 

25 44181, 67084FL, or 67084alt gene expression, polypeptide levels, or downregulate 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL,or 67084alt activity, can be 
monitored in clinical trials of subjects exhibiting increased 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt gene expression, polypeptide levels, or 
upregulated 8099, 46455, 54414, 53763,' 67076, 67102, 44181, 67084FL„or 67084alt 

30 activity. In such clinical trials, the expression or activity of a 8099, 46455, 54414, 53763, 
67076, 67102, 441 81, 67084FL, or 67084alt gene, and preferably, other genes that have 
been implicated in, for.example, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt -associated disorder can be used as a "read out" or markers of the 
phenotype of a particular cell. 

35 For example, and not by way of limitation, genes, including 8099, 46455, 54414, 

53763, 67076, 67102, 44181, 67084FL, or 67084alt , that are modulated in cells by 
treatment with an agent (e.g., compound, drug or small molecule) which modulates 67076, 



114 

BNSDOCID: <WO_02055701 A2_l_> 



WO 02/055701 



PCT/US01/49060 



67102, 44181, 67084FL, or 67084alt activity {e.g., identified in a screening assay as 
described herein) can 'be identified. 

Thus, to study the effoct of agents on phospholipid transporter-associated disorders 
(e.g., disorders characterized by deregulated gene expression, cellular signaling, 67076, 

5 67102, 44181, 67084FL, or 67084alt activity, phospholipid transporter activity, and/or cell 
growth, proliferation, differentiation, absorption, and/or secretion mechanisms), for 
example, in a clinical trial, cells can be isolated and RNA prepared and analyzed for the 
levels of expression of 67076,»67102, 44181, 67084FL, or 67084alt and other genes' ; 
implicated in the .transporter-associated disorder, respectively. The levels of gene 

10 expression (e.g. , a gene expression pattern) can be quantified by* northern blot analysis or 
RT-PCR, as described herein, or alternatively by measuring the amount of polypeptide 
produced, by one of the methods as described herein, or by measuring the levels of activity 
of 67076, 67102, 44181, 67084FL, or 67084alt or other genes. In this way, the gene 
expression pattern can serve as a marker, indicative of the physiological response of the cells 

15 to the agent. Accordingly, this response state may be determined before, and at various 
points during treatment of the individual with the agent. 1 

In a preferred embodiment, the present invention provides a method for monitoring 
the effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug candidate 

20 identified by the screening assays described herein) including the steps of (i) obtaining a 
pre-administration sample from a subject prior to administration of the agent; (ii) detecting 
the level of expression of a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt polypeptide, mRNA, or genomic DNA in the preadministration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the level 

25 of expression or activity of the 8099, 46455, 54414, 53763,' 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide, mRNA, or genomic DNA in $he post-administration samples; (v) 
comparing the level of expression or activity of the 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptide, mRNA, or genomic DNA in the pre- 
administration sample with the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 

30 or 67084alt polypeptide, mRNA, or genomic DNA in the post administration sample or 
samples; and (vi) altering the administration of the agent to the subject accordingly. For 
example, increased administration of the agent may be desirable to increase the expression 
or activity of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt to 
higher levels than detected, i.e., to increase the effectiveness of the agent. Alternatively, 

35 decreased administration of the agent may be desirable to decrease expression or activity of 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt to lower levels 
than detected, i.e. to decrease the effectiveness of the agent. According to such an 
embodiment, 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
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expression' or activity may be used as an indicator of the effectiveness of an agent, even in 
the absence of an obsdiVable phenotypic response. 

D. » Methods of Treatment: 
5 The present invention provides for both prophylactic and therapeutic methods of 

treating a subject at risk of (or susceptible to) a disorder or having a disorder associated with 
aberrant or unwanted 8099, 46455, 54414, 53763, 67076- 67102, 44181, 67084FL, or 
67084ait expression or activity, e.g. a phospholipid <!ransporter-asso&ated disorder. ' ; 
"Treatment", as used herein, is defined as the application or administration of a therapeutic 

10 agent to a p&tient, or application or administration of a therapeutic agent to an isolated tissue 
or cell line from a p'atient, who has a disease or disorder, a symptom of disease or disorder 
or a predisposition toward a disease or disorder, with the purpose to cure, heal, alleviate, 
relieve, alter, remedy, ameliorate, improve or affect the disease or disorder, the symptoms of 
disease or disorder or the predisposition toward a disease or disorder. A therapeutic agent 

15 includes, but is not limited to ? small molecules, peptides, antibodies, ribozymes and 
antisense oligonucleotides. * 1 

With regards to both prophylactic and therapeutic methods of treatment, such 
treatments may be specifically tailored or modified, based on knowledge obtained from the 
field of pharmacogenomics. "Pharmacogenomics", as used herein, refers to the application 

20 of genomics technologies such as gene sequencing, statistical genetics, and gene expression 
analysis to drugs in clinical development and on the market. More specifically, the term 
refers the study of how a patient's genes determine his or her response to a drug (e.g., a 
patient's "drug response phenotype", or "drug response genotype"). Thus, another aspect of 
the invention provides methods for tailoring an individual's prophylactic or therapeutic 

25 treatment with either the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt molecules of the present invention or 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt modulators according to that individual's drug response 
genotype. Pharmacogenomics allows a clinician or physician to target prophylactic or 
therapeutic treatments to patients who will most benefit from the treatment and to avoid 

30 treatment of patients who will experience toxic drug-related side effects. 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a disease 
or condition associated with an aberrant or unwanted 8099, 46455, 54414, 53763, 67076, 
35 67 1 02, 44 1 8 1 , 67084FL, or 67084alt expression or activity, by administering to the subject 
a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt or an agent 
which modulates 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
expression or at least one 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
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67084alt activity. Subjects at risk for a disease which is caused or contributed to by 
•"■ aberrant or unwanted 8099, 46455, 544 H, 53763, 67076,. 67 102, 44181, 67084FL, or 

67084alt expression or activity can be identified by, for example, any or a combination 6f 
diagnostic or prognostic assays as described herein. Administration of a prophylactic agent 
5 can occur prior to the manifestation of symptoms characteristic of the 8099, 46455, 544 14, 
53763, 67076, 67102, 441 81, 67084FL, or 67084alt aberrancy, such that a disease or 
disorder is prevented or, alternatively, delayed in its progression. Depending on the. type of 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL 1 , or 67084alt aberrancy, for 
example, a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt , 8099, 
10 46455, 54414, 53763, 67076, 167102, 44181, 67084FL, or 67084alt agonist or 8099, 46455, 
' £4414, 53763, 67076, 67102, 44,181, 67084FL, or 67084alt antagonist agent can be used for 
treating the subject. The appropriate agent can be determined based on screening assays 
described herein. 

15 2. Therapeutic Methods , , 

Another aspect of the invention pertains to methods of modulating 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt expression or activity for 
therapeutic purposes. Accordingly, in an exemplary embodiment, the modulatory method of 
the invention involves contacting a cell capable of expressing 8099, 46455, 54414, 53763, 

20 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt with an agent that modulates one or more of 
the activities of 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
polypeptide activity associated with the cell, such that 8099, 46455, 54414, 53763, 67076, 
67 1 02, 44 1 8 1 , 67084FL, or 67084alt activity in the cell is modulated. An agent that 
modulates 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

25 polypeptide activity can be an agent as described herein, such as a nucleic acid or a 

polypeptide, a naturally-occurring target molecule of a 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt polypeptide (e.g., a 8099, 46455, 54414, 53763, 
67076, 67102, 441 81, 67084FL, or 67084alt substrate), a 8099, 46455, 54414, 53763, 
67076, 61 1 02, 44 1 8 1 , 67084FL, or 67084alt antibody, a 8099, 46455, 544 1 4, 53763, 

30 67076, 67102, 441 81, 67084FL, or 67084alt agonist or antagonist, a peptidomimetic of a 
8099, 46455, 5441 4, 53763, 67076, 671 02, 441 81, 67084FL, or 67084alt agonist or 
antagonist, or other small molecule. In one embodiment, the agent stimulates one or more 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt activities. 
Examples of such stimulatory agents include active 8099, 46455, 54414, 53763, 67076, 

35 67 1 02, 44 1 8 1 , 67084FL, or 67084alt polypeptide and a nucleic acid molecule encoding 
8099, 46455, 5441 4, 53763, 67076, 671 02, 441 81, 67084FL, or 67084alt that has been 
introduced into the cell. In another embodiment, the agent inhibits one or more 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt activities. Examples of 
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such inhibitory agents include antisense 8099, 46455, 54414, 53763, 67076, 671 02, 441 81; 
67084FL, or 67084alt nucleic acid molecules, anti-8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alf 3 antibodies, 'and 8099, 46455, 54414, 53763, 67076, 
67102', 44181, 67084FL, or 67084alt inhibitors. These modulatory methods can be 
5 performed in vitro (e.g., by culturing the cell with the'agent) or, alternatively, in vivo (e.g., 
by administering the agent to a subject). As such, the present invention provides methods Of 
treating an individual affljcted with a disease or disorder characterized by aberrant or 
unwanted expression Or activity of a 8099, 46455, 54414, 53763, 67076, 67i02, 44181, 
67084FL, or 67084alt polypeptide or nucleic acid molecule. In one embodiment, the 

10 method involves administering an agent (e.g., an agent identified by a screening assay 
1 described herein), or combination of agents that modulates (e.g. , upregulates or 1 
downregulates) 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
expression or activity. In another embodiment, the method involves administering a 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or nucleic 

15 acid molecule as therapy to compensate for reduced, aberrant, or unwanted 8099, 4645 5 , 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt expression or activity. 

Stimulation of 8099, 46455, 54414, 53763, 67076„67102, 44181, 67084FL, or 
67084alt activity is desirable in situations in which 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt is abnormally dOwnregulated and/or in which 

20 increased 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt activity 
is likely to have a beneficial effect. Likewise, inhibition of 8099, 46455, 54414, 53763, 
67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt activity is desirable in situations in which 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt is abnormally 
upregulated and/or in which decreased 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

25 67084FL, or 67084alt activity is likely to have a beneficial effect. 

3. Pharmacogenomics 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
molecules of the present invention, as well as agents, or modulators which have a 

30 stimulatory or inhibitory effect on 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt activity (e.g., 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt gene expression) as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically), for example, 
disorders characterized by aberrant 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

35 67084FL, or 67084alt gene expression, 8099, 46455, 54414, 53763, 67076, 67102, 441 81, 
67084FL, Or 67084alt activity, membrane excitability or conductance, gene transcription, 
phospholipid transporter activity, cellular signaling, and/or cell growth, proliferation, 
differentiation, absorption, and/Or secretion disorders associated with aberrant or unwanted 
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8099,46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt activity. In 
conjunction with such treatment, pharmacogenomics (/Ye., the study of thd relationship 
between an individual's genotype and that individual's response to a foreign compound or 
drug) may be considered. Differences in itfetabolism of therapeutics can lead to Severe 
5 toxicity or therapeutic failure by altering the relation between dose and blood concentration 
of the phaimacologically active drug. Thus, a physician or clinician may consider applying 
knowledge obtained in relevant pharmacogenomics studies in determining whether to 
administer a 8099, 46455, ? 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084'alt . 
molecule or 8099,, 46455, 544i4, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

10 modulator as well as tailoring the dosage and/or therapeutic regimen of treatment with a 
8099, 46455, 54414, $3763; 67076, 67102, 44181, 67084FL, or 67084alt molecule or 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected persons. 

15 See, for example, Eichelbaum, }A. et aL (1996) Clin. Exp. Pharmacol. Physiol. 23(10-1 1): 
983-985 and Under, M.W. etal. (1997) Clin. Chem. 43(2):254-266. In general, two types 
of pharmacogenetic conditions can be differentiated. Genetic conditions transmitted as a 
single factor altering the way drugs act on the body (altered drug action) or genetic 
conditions transmitted as single factors jaltering the way the body acts on drugs (altered drug 

20 metabolism). These pharmacogenetic conditions can occur either as rare genetic defects or 
as naturally-occurring polymorphisms. For example, glufcose-6-phosphate dehydrogenase 
deficiency (G6PD) is a common inherited enzymopathy in which the main clinical 
complication is haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, 
analgesics, nitrofurans) and consumption of fava beans. 

25 One pharmacogenomics approach to identifying genes that predict drug response, 

known as "a genome-wide association", relies primarily on a high-resolution map of the 
human genome consisting of already known gene-related markers (e.g., a. M bi-allelic n gene 
marker map which consists of 60,000-1 00,000 polymorphic or variable sites on the human 
genome, each of which has two variants.) Such a high-resolution genetic map can be 

30 compared to a map of the genome of each of a statistically significant number of patients 
taking part in a Phase II/IH drug trial to identify markers associated with a particular 
observed drug response or side effect. Alternatively, such a high resolution map can be 
generated from a qombination of some ten-million known single nucleotide polymorphisms 
(SNPs) in the human genome. As used herein, a "SNP" is a common alteration that occurs 

35 in a single nucleotide base in a stretch of DNA. For example, a SNP may occur once per 
every 1 000 bases of DNA. A SNP may be involved in a disease process, however, the vast 
majority may not be disease-associated. Given a genetic map based on the occurrence of 
such SNPs, individuals can be grouped into genetic categories depending on a particular 
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pattern of SNPs in their, individual genome. In such a manner, treatment regimens can be 
tailored to groups of genetically similar individuals, taking into account traits that may be . 
common among such genetically similar individuals. \ . 

Alternatively, a method termed the "candidate gene approach", can be utilized to 
5 identify genes that predict drug response. According to this method, if a gene that encodes a 
drugs target is known (e.g., a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt polypeptide of the present invention), all common variants of that gene can be 
fairly easily identified in the population and it can be determined if having one version of 
the gene versus another is associated with a particular drug response. 

10 As an illustrative emb6diment, the activity of drug metabolizing enzymes is a major 

' determinant of both the intensity and duration of drug action. The discovery of genetic 1 
polymorphisms of drug metabolizing enzymes (e. g. , N-acety transferase 2 (NAT 2) and 
cytochrome P450 enzymes CYP2D6 and CYP2C 1 9) has provided an explanation as to why 
some patients do not obtain the expected drug effects or show exaggerated drug response 

15 and serious toxicity after taking the standard and safe dose of a drug. These polymorphisms 
are expressed in two phenotypes in the population, the extensive metabolizer (EM) and poor 
metabolizer (PM). The prevalence of PM is different among different populations. For 
example, the gene coding for CYP2D6 is highly polymorphic and several mutations have 
been identified in PM, which all lead to the absence of functional CYP2D6. Poor 

20 metabolizers of CYP2D6 and CYP2C 1 9 quite frequently experience exaggerated drug 
response and side effects when they receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as demonstrated for the analgesic 
effect of codeine mediated by its CYP2D6-formed metabolite morphine. The other extreme 
are the so called ultra-rapid metabolizers who do not respond to standard doses. Recently, 

25 the molecular basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 
amplification. 

Alternatively, a method termed the n gene expression profiling", can be utilized to 
identify genes that predict drug response. For example, the gene expression of an animal 
dosed with a drug (e.g., a 8099, 46455, 54414, 53763, 67076, 67102, 441,81, 67084FL, or 

30 67084alt molecule or 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 

67084alt modulator of the present invention) can give an indication whether gene pathways 
related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for • 

35 prophylactic or therapeutic treatment an individual. This knowledge, when applied to 

dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus enhance 
therapeutic or prophylactic efficiency when treating a subject with a 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt molecule or 8099, 46455, 54414, 
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53763, 67076, 67102, 44181, 67084FL, or67084alt modulator, such as a modulator 
identified by one of the exemplary screening assays described herein. 

4. . • Use of 8099, 46455, 54414,' 53763, 67076, 67102, 44181, 67084FL, or 

5 67084alt Molecules as Surrogate Markers 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
molecules of the invention, are also useful as markers of disorders or disease states, as 
markers for precursors of disedse states, as markers for predisposition of disease states, as 
markers of drug activity, or as markers of the pharmacogenomic profile of a subject. Using 

10 the method^ described herein, th^ presence, absence and/or quantity of the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt molecules of the invention may 
be detected, and may be correlated with one or more biological states in vivo. For example, 
the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, pr 67084alt molecules of 
the invention may serve as surrogate markers for one or more disorders or disease states or 

15 for conditions leading up to cjiselafce states. As used herein, a "surrogate marker" is an. 

objective biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or absence 
of a tumor). The presence or quantity of such markers is independent of the disease. 
Therefore, these markers may serve to indicate whether a particular course of treatment is 

20 effective in lessening a disease state or disorder. Surrogate markers are of .particular use 
when the presence or extent of a disease state or disorder is difficult to assess through 
standard methodologies (e.g., early stage tumors), or when an assessment of disease 
progression is desired before a potentially dangerous clinical endpoint is reached (e.g., an 

assessment of cardiovascular disease may be made using .cholesterol levels as a surrogate 

' i ... 

25 marker, and an analysis of HIV infection may be made using HIV RNA levels as a surrogate 
marker, well in advance of the undesirable clinical outcomes of myocardial infarction or 
fully-developed AIDS). Examples of the use of surrogate markers in the art include: 
Koomen et al (2000) J. Mass. Spectrom. 35: 258-264; and James (1994) AIDS Treatment 
News Archive 209. 

30 The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

molecules of the invention are also useful as pharmacodynamic markers. As used herein, a 
"pharmacodynamic marker" is an objective biochemical marker which correlates 
specifically with drug effects. The presence or quantity of a pharmacodynamic marker is 
not related to the disease state or disorder for which the drug is being administered; 

35 therefore, the presence or quantity of the marker is indicative of the presence or activity of 
the drug in a subject. For example, a pharmacodynamic marker may be indicative of the 
concentration of the drug in a biological tissue, in that the marker is either expressed or 
transcribed or not expressed or transcribed in that tissue in relationship to the level of the 
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drug. In this fashion, the distribution or uptake of the drug may be monitored by the' 
pharmacodynamic marker. Similarly, the presence or quantity of the pharmacodynamic 
marker may be related to the presence or quantity of the metabolic produrft of a drug, such 
that the presence or quantity of the marker is indicative of the relative breakdown rate of the 
5 drug in vivo. Pharmacodynamic markers are of particular use in increasing the sensitivity of 
detection of drug effects, particularly when the drug is administered in low doses. Since 
even a small amount of a drug may be sufficient to activate multiple rounds of marker (e.g., 
a 8099, 46455, 54414, 53763', 67076, 67102, 44l8l, 67084FL, or 67084alf marker)' ! 
transcription or expression, the amplified marker may be in a quantity which is more readily 
10 detectable than the drug itself. Also, the marker may be more easily detected due to the 
nature of the marker itself; for example, using the methods described herein, anti-8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt antibodies may be' 
employed in an immune-based detection system for a 8099,. 46455, 544 1 4, 53763 , 67076, 
_ 67102, 44181, 67084FL, or 67084alt polypeptide marker, or 8099, 46455, 54414, 53763, 

15 67076, 67102, 44181, 67084FD, or 67084alt -specific radiolabeled probes may be used to' 
detect a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt mRNA 
marker. Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct observations. 
Examples of the use of pharmacodynamic markers in the art include: Matsuda etal. US 
20 6,033,862; Hattis et al (\99\).Env. Health Perspect. 90: 229-238; Schentag (1999) Am. J. 
Health-Syst. Pharm. 56 Suppl. 3: S21-S24; and Nicolau (1999) Am, J. Health-Syst. Pharm. 
56 Suppl. 3: S16-S20. 

The 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
molecules of the invention are also useful as pharmacogenomic markers. As used herein, a 
25 "pharmacogenomic marker" is an objective biochemical marker which correlates with a 
specific clinical drug response or susceptibility in a subject (see, e.g., McLeod et al. (1 999) 
Eur. J. Cancer 35(12): 1 650-1652). The presence or quantity of the pharmacogenomic 
marker is related to the predicted response of the subject to a specific drug or class of drugs 
prior to administration of the drug. By assessing the presence or quantity of one or more 
30 pharmacogenomic markers in a subject, a drug therapy which is most appropriate for the 
subject, or which is predicted to have a greater degree of success, may be selected. For 
example, based on the presence or quantity of RNA, or polypeptide (e.g. ,8099, 464 55, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide or RNA) for ' 
specific tumor markers in a subject, a drug or course of treatment may be selected that is 
35 optimized for the treatment of the specific tumor likely to be present in the subject. 

Similarly, the presence or absence of a specific sequence mutation in 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL, or 67084alt DNA may correlate 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt drug response. The use of 
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pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 

VI. Electronic Apparatus Readable Media and Arrays ■ 
5 Electronic apparatus readable media comprising 8099, 46455, 54414, 53763, 67076, 

67 1 02, 44 1 8 1 , 67084FL, or 67084alt sequence information is also provided. As used 
herein, "8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084ait sequence 
information" refers to any nucleotide and/or amino acid sequence information particular to 
the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67p84FL, or 67084alt molecules of 
1 o the present invention, including but not limited to full-liength nucleotide and/or amino acid 
' sequences, partial nucleotide and/or amino acid sequences, polymorphic sequences 
including singlenucleotide polymorphisms (SNPs), epitope sequences, and the like. 
.Moreover, information "related to" said 8099,46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt sequence information includes detection of the presence or absence 
15 of a sequence (e.g., detection of expression of a sequence, fragment, polymorphism, etc.X 
determination of the level of a sequence (e.g. , detection.of a level of expression, for 
example, a quantative detection), detection of a reactivity to a sequence (e.g., detection of 
protein expression and/or levels, for example, using a sequence-specific antibody), and the 
like. As used herein, "electronic apparatus readable media" refers to any suitable medium 
20 for storing, holding or containing data or information that can be read and accessed directly 
by an electronic apparatus. Such media can include, but are not limited to: magnetic storage 
media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage 
media such as compact disc; electronic storage media such as RAM, ROM, EPROM, 
EEPROM and the like; general hard disks and hybrids of these categories such as 
25 magnetic/optical storage media. The medium is adapted or configured for having recorded 
thereon 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt sequence 
information of the present invention. 

As used herein, the term "electronic apparatus" is intended to include any suitable 
computing or processing apparatus or other device configured or adapted' for storing data or 
30 information. Examples of electronic apparatus suitable for use with the present invention 
include stand-alone computing apparatus; networks, including a local area network (LAN), a 
wide area network (WAN) Internet, Intranet, and Extranet; electronic appliances such as a 
personal digital assistants (PDAs), cellular phone, pager and the like; and local and 
distributed processing systems. 
35 As used herein, "recorded" refers to a process for storing or encoding information on 

the electronic apparatus readable medium. Those skilled in the art can readily adopt any of 
the presently known methods for recording information on known media to generate 
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manufactures comprising the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt sequence information. 

A variety of software programs and formats can be used to store: the sequence ' 
infonhation on the electronic apparatus readable medium. For example, the sequence 

5 information can be represented in a word processing text file, formatted in commercially-- 
available software such as WordPerfect and MicroSoft Word, or represented in the form of 
an ASCII file, stored in a database applitation; such as DB2, Sybase, Oracle, or the like, as 
well as in other forms. Any number of dataprbcessor structuHng formats (e.g., text file or 
database) may be employed in order to obtain or create a medium having recorded thereon 

10 the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL; or 67084alt sequence 

1 information. i 

i ■ ...» 

By providing 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt sequence information in readable form, one can routinely access the sequence 
information for a variety of purposes. For example, one skilled in the art can use the 

1 5 sequence information in readable form to compare a target sequence or target structural 

motif with the sequence information stored within the data storage means. Search means are 
used to identify fragments or regions of the sequences of the invention which match a 
particular target sequence or target motif. 

The present invention therefore provides a medium for holding instructions for 

20 performing a method for determining whether a subject has a 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt -associated disease or disorder or a pre- 
disposition to a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt - 
associated disease or disorder, wherein the method comprises the steps of determining 8099, 
46455, 54414, 53763, 670.76, 67102, 44181, 67084FL, or 67084alt sequence information 

25 associated with the subject and based on the 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt sequence information, determining whether the subject has a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -associated 
disease or disorder or a pre-disposition to a 8099, 46455, 54414, 53763,67076,67102, 
44 1 8 1 , 67084FL, or 67084alt -associated disease or disorder and/or recojnmending a 

30 particular treatment for the disease, disorder or pre-disease condition. 

The present invention further provides in an electronic system and/or in a network* a 
method for determining whether a subject has a 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt -associated disease or disorder or a pre disposition to a disease 
associated with a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 

35 wherein the method comprises the steps of determining 8099, 46455, 54414, 53763, 67076, 
67102, 44181, 67084FL, or 67084alt sequence information associated with the subject, and 
based on the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
sequence information, determining whether the subject has a 8099, 46455, 54414, 53763, 
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67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt -associated disease or disorder or a pre- 
disposition to a 8099, '46455, 54414, 53763,' 67076, 67102, 44181, 67084ft,, or 67084a!t - 
associated disease or disorder,* and/or recommending a particular treatment for the disease, 
disorder or pre-disease condition. The metfiod may further Comprise the step of receiving 
5 phenotypic information associated with the subject and/or acquiring from a network 
phenotypic information associated with the subject . • 

The present invention also provides in a network,' a method for determihihg whether 
a subject has a 8099, 46455, 54414, 537*63, 67076, 67102, 44181, 67084FL, or 6708'4alt - 
associated disease or disorder or a pre-disposition to a 8099, 46455, 544 14, 53763, 67076, 
10 67102, 44181, 67084FL, or 67084alt -associated disease or disorder associated with 8099, 
46455, 54414, 53763,' 67076, 67102, 44181, 67084FL, or 67084alt , said method comprising 
the steps 'of receiving 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or , 
67084alt sequence information from the subject arid/or information related thereto, 
receiving phenotypic information associated with the subject, acquiring information from 
15 the network corresponding to 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt and/or a 8099, 46455, 54414, 53763,- 67076- 67102, 44181, 67084FL, or 
67084alt -associated disease or disorder, and based on one or more of the phenotypic 
information, the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084ait 
information (e.g., sequence information and/or information related thereto), and the acquired 
20 information, determining whether the subject has a 8099, 46455, 54414, 53763, 67076, 

67 1 02, 44 1 8 1 , 67084FL, or 67084alt -associated disease or disorder or a pre-disposition to a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 670 84FL, or 67084alt -associated disease 
or disorder. The method may further comprise the step of recommending a particular 
treatment for the disease, disorder or pre-disease condition. 
25 The present invention also prQvides a business method for determining whether a 

subject has a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt - 
associated disease or disorder or a pre-disposition to a 8099, 46455, 54414, 53763, 67076, 
67102, 441 81, 67084FL, or 67084alt -associated disease or disorder, said method 
comprising the steps of receiving information related to 8099, 46455, 54414, 53763, 67076, 
30 67102, 44181, 67084FL, or 67084alt (e.g., sequence information and/or information related 
thereto), receiving phenotypic information associated with the subject, acquiring 
information from the network related to 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt and/or related to a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
67084FL, or 67084alt -associated disease or disorder, and based on one or more: of the 
35 phenotypic information, the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 
67084alt information, and the acquired information, determining whether the subject has a 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -associated disease 
or disorder or a pre-disposition to a 8099, 46455, 54414, 53763, 67076, 67102, 44181, 
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67084FL, or 67084alt -associated disease or disorder. The method may further comprise the 
step of recommending a particular treatment for the disease, disorder or pre-disease 
condition, • ' 1 . r ' 

The invention also includes an array comprising a 8099, 46455, 54414, 53763, 
5 67076, 67 1 02, 44 1 8 1 , 67084FL, or 67084alt sequence of the present invention. The array 
can be used to assay expression of one or more genes in the array. In one embodiment, the 
array can be used to assay . gene expression in a tissije to ascertain tissue specificity of genes 
in the array. In this manner, ufy to about 7600 genes can be simultaneously assayed for 
expression, one of which can be 8099, 46455, 54414, 53763, 67076, 67102, 44181, 

10 67084FL, oir 67084alt This allqws a profile to be developed showing a battery of genes 
specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the quantitation of 
gene expression. Thus, not only tissue specificity, but also the leyel of expression of a 
battery of genes in the tissue is ascertainable. 1 Thus, genes can be grouped on the basis of 

1 5 their tissue expression per se ,and level of expression in that tissue. This is useful, for . 
example, in ascertaining the relationship of gene expression between or among tissues. 
Thus, one tissue can be perturbed and the effect on gene expression in a second tissue can be 
determined. In this context, the effect of one cell type on another cell type in response to a 
biological stimulus can be determined. Such a determination is useful, for example, to know 

20 the effect of cell-cell interaction at the level of gene expression. If an agent is administered 
therapeutically to treat one cell type but has an undesirable effect on another cell type, the 
invention provides an assay to determine the molecular basis of the undesirable effect and 
thus provides the opportunity to co-administer a counteracting agent or otherwise treat the 
undesired effect. Similarly, even within a single cell typq, undesirable biological effects can 

25 be determined at the molecular level. . Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course of 
expression of one or more genes in the array. , This can occur in various biological contexts, 
as disclosed herein, for example development of a 8099, 46455, 54414, 53763, 67076, 

30 67102, 441 81, 67084FL, or 67084a! t -associated disease or disorder, progression of 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -associated disease or 
disorder, and processes, such a cellular transformation associated with the 8099, 46455, 
54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt -associated disease or disorder. 
The array is also useful for ascertaining the effect of the expression of a gene on the 

35 expression of other genes in the same cell or in different cells {e.g., acertaining the effect of 
8099, 46455, 54414, 53763, 67076, 67102, 44181 , 67084FL, or 67084alt expression on the 
expression of other genes). This provides, for example, for a selection of alternate 
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I . . 

molecular targets for therapeutic intervention if the ultimate or downstream target cannot be 
regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more jgenes in normal and abnormal cells. This providps a battery of genes (e.g. , including 
5 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt ) that could serve 
as a molecular target for diagnosis or therapeutic intervention. 
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. This invention is further illustrated by the following examples which should iiot be 
construed as limiting. The cohtents of all references, patents and published patent 
application^ cited throughout this application, as well as the Figures and the Sequence 
5 Listing, are incorporated herein by reference. 

1 EXAMPLES 

10 EXAMPLE 1 : IDENTIFICATION AND CHARACTERIZATION OF HUMAN 

8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, AND 
67084alt cDNAs • 

In this example, the identification and characterization of the gene encoding human 
15 8099, 46455, 54414, 53763, 67076, 67102, 44181, full length 67084 (67084FL), and 
67084alt is described. 

i 

• 

Isolation of the human 8099 and 46455 cDNAs 

The invention is based, at least in part, on the discovery of a human gene encoding a 

20 novel polypeptide, referred to herein as human 8099. The entire sequence of the human 
clone 8099 was determined and found to contain an open reading frame termed human 
"8099." The nucleotide sequence of the human 8099 gene is set forth in Figures 1 A-B and 
in the Sequence Listing as SEQ ID NO:L The amino acid sequence of the human 8099 
expression product is set forth in Figure 1 and in the Sequence Listing as SEQ ID NO: 2, 

25 The 8099 polypeptide comprises 617-amino acids. The coding region (open reading frame) 
of SEQ ID NO:l is set forth as SEQ ID NO:3. Clone 8099, comprising the coding region of 
human 8099, was deposited with the American Type Culture Collection (ATCC®), 1 080 1 
University Boulevard, Manassas, V A 201 10-2209, on and assigned Accession 

No. 

30 The invention is further based, at least in part, on the discovery of a human gene 

encoding a novel polypeptide, referred to herein as human 46455. The entire sequence of 
the human clone 46455 was determined and found to contain an open reading frame termed 
human "46455." The nucleotide sequence of the human 46455 gene is set forth in Figure 4 
and in the Sequence Listing as SEQ ID NO :4. The amino acid sequence of the human 

35 46455 expression product is set forth in Figures 8A-B and in the Sequence Listing as SEQ 
ID NO:5. The 46455 polypeptide comprises 528 amino acids. The coding region (open 
reading frame) of SEQ ID NO:4 is set forth as SEQ ID NO:6. Clone 46455, comprising the 
coding region of human 46455, was deposited with the American Type Culture Collection 
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(ATCC®), 1 0801 University Boulevard, Manassas, VA 201 1 0-2209, on ■ and 
assigned Accession No. 

Analysis of the Human 8099 and 46455 Molecules 
5 A search using the polypeptide sequence of SEQ ID NO:2 was performed against the 

HMM database in PFAM (Figures 3 A-B) resulting in the identification of a sugar 
transporter family domain in the amino acid sequence of human; 8099 at about residues 43- 
564 of SEQ ID NO:2 (score = 3 1 8.2), a potential FecCD failiily domain in the amino acid 
sequence of human 8099 at about residues 26-227 of SEQ ID NO:2 (score - -218.2), and a 

10 potential monocarboxylate transporter domain in the amino acid sequence of human 8099 at 
■■ about residues 29-567 of SEQ ID NO:2 (score = -235!8). 1 
The amino acid sequence of human 8099 was analyzed using the program PSORT 
(available through the Prosite website) to predict the localization of the proteins within the 
cell. This program assesses the presence of different targeting and localization amino acid 

15 sequences within the query sequence. The results of this analysis show that human 8099 
may be localized to the endoplasmic reticulum or mitochondria. 

Searches of the amino acid sequence of human 8099 were further performed against 
the Prosite database. These searches resulted in the identification in the amino acid 
sequence of human 8099 of a number of potential N-glycosylation sites at about amino acid 

20 residues 371-374, 383-386, 396-399, 401-404 of SEQ ID NO:2, a number of potential 

protein kinase C phosphorylation sites at about amino acid residues 220-222, 256-258, 403- 
405 of SEQ ID NO:2, a number of potential casein kinase II phosphorylation sites at about 
amino acid residues 18-21, 75-78, 169-172, 246-249, 256-259, 264-267, 385-388, 403-406, 
443-446, 520-523 of SEQ ID NO:2, a number of potential N-myristoylation sites at about 

25 ^mino acid residues 51-56, 59-64, 89-94, 141-146, 165-170, 178-183, 207-212, 228-233, 
395-400, 441-446, and 493-498 of SEQ ID NO:2, a potential amidation site at about amino 
• acid residues 104- 107 of SEQ ID NO:2, a potential leucine zipper motif at about amino acid 
residues 1 12-133 of SEQ ID NO:2, and potential sugar transport protein signature 1 domain 
at about amino acid residues 97-1 14 of SEQ ID NO:2. 

30 A MEMS AT analysis of the polypeptide sequence of SEQ ID NO:2 was also 

performed, predicting thirteen transmembrane domains in the amino acid sequence of 
' human 8099 (SEQ ID NO:2) at about residues 32-49, 58-74, 81-101, 109-130, 138-156, 
165^184, 198-217, 279-301, 315-338; 346-364, 463-487, 499-521, and 529-549. Further 
analysis of the amino acid sequence of SEQ ID NO:2 (e.&, alignment with, for example, 

35 known E r coli sugar symporter proteins and a known human facilitative glucose transporter 
protein) showed that the second transmembrane domain at about amino acid residues 58-74 
of SEQ ID NO:2 is not utilized, resulting in the presence of twelve transmembrane domains 
in the amino acid sequence of human 8099 (SEQ ID NO:2). 
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A search of the amino acid sequence of human 8099 was also performed against the 
ProDom database resulting in the identification of several transmembrane domains, a 
glycosyltrarisferase domain, and a sugar transport domain in 1 the amino acid sequence pf 
SEQ ID NO:2. , , 

5 The human 8099 amino acid sequence was aligned with the amino acid sequence of 

the galactose-protori symporter GALP from E. coli using the CLUSTAL W (1 .74) multiple 
sequence alignment program. The results of the alignment are set forth in Figure 4. The 
human 8099 amino acid sequence was also aligned with the amino acid sequence of the 
arabinose-proton symporter ARAE from K coli using the CLUSTAL W ( 1 .74) multiple 

10 sequence alignment program. ' The results of the alignment are set forth in Figure 5. The 
human 8099 amino acid sequence was also aligned with the amino acid sequence of the * 
facilitative glucose transporter GLUT8 from Homo sapiens using the CLUSTAL W (1 74) 
, .multiple sequence alignment program. The results of the alignment are set forth in Figure 7 
Based on its homology to GLUT8, 8099 is also referred to herein as "GLUT8 homologue" 

15 or "GLUT8h" and can be used interchangeably throughout. . 

A search using the polypeptide sequence of human 46455 (SEQ ID NO :5) was 
performed against the HMM database in PFAM (Figures 1.0A-C) resulting in the 
identification of a sugar transporter family domain in the amino acid sequence of human 
46455 at about residues 58-469 of SEQ ID NO:5 (score = -63.4), a potential 

20 sodium:galactoside symporter family domain in the amino acid sequence of human 46455 at 
about residues 212-505 of SEQ ID NO:5 (score = -121.2), and a potential monocarboxylate 
transporter domain in the amino acid sequence of human 46455 at about residues 60-473 of 
SEQ ID NO:5 (score = -208.2). 

The amino acid sequence of human 46455 was analyzed using the program PSORT 

25 to predict the localization of the proteins within the cell. This program assesses the presence 
of different targeting and localization amino acid sequences within the query sequence. The 
results of this analysis show that human 46455 may be localized to the endoplasmic 
reticulum, mitochondria, nucleus or secretory vesicles. 

Searches of the amino acid sequence of human 46455 were further performed against 

30 the Prosite database. These searches resulted in the identification in the amino acid 

, sequence of human 46455 of a potential N-glycosylation site at about amino acid residues 
199-202 of SEQ ID NO:5, a potential cAMP- and cGMP-dependent protein kinase C 
phosphorylation site at about amino acid residues 414-417 of SEQ ID NO:5, a number of 
potential protein kinase C phosphorylation sites at about amino acid residues 344-346, 413- 

35 415, 442-444, and 518-520 of SEQ. ID NO:5, a number of potential casein kinase II 

phosphorylation sites at about amino acid residues 1 1-14, 943-946, 959-962, 983-986, 1074- 
1077, 1 108-1 1 1 1 , and 1 1 12-1 1 15 of SEQ ID NO:5, a number of potential N-myristoylation 
sites at about amino acid residues 89-94, 106-1 1 1 , 288-293, 679-684, 767-772, 847-852, and 
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933-938 of SEQ ID NO:5, an amidation site at about amino acid residues 223-226 ofSEQ 
ID NO;5, and an ATP/GTP-binding site motif A (P-loop) at about amino acid residues 
1008-1015 of SEQ ID NO:5. 1 • > 

A MEMSAT analysis of the polypeptide sequence of SEQ ID NO:5 was also 

5 performed, predicting eleven transmembrane domains in the amino acid sequence of human 
46455 (SEQ ID NO:5) at about residues 98-1 18, 126-145, 165-181, 188-205, 218-238, 273- 
294, 323-341, 357-377, 386-410, 423-441, and 462-485. Further analysis of the amino acid 
sequence of SEQ ID NO:5 (e.g. 1 , analysisof the hydropathy plot set forth in Figure 9) 
resulted in the identification of a twelfth transmembrane domain at about amino acid 

10 residues 58-74 of SEQ ID NO:5. , 

A search of the amino acid sequence of human 46455 was also performed against the 
ProDom database- resulting in the identification of a transmembrane efflux domain in the 
amino acid sequence of SEQ ID NO:5. , 

The human 46455 amino acid sequence was aligned with the amino acid sequence of 

15 Z92825 from C. elegans using the CLUSTAL W (1 .74) multiple sequence alignment . 
program. The results of the alignment are set forth in Figure 11. 1 

■ * » . 
Isolation of the human 54414 and 53763 cDNA 

The invention is based, at least in part, on the discovery of genes encoding novel 
20 members of the potassium channel family . The entire sequence of human clone Fbh54414 
was determined and found to contain an open reading frame termed human "54414". The 
entire sequence of human clone Fbh53763 was determined and found to contain an open 
reading frame termed human "53763". 

The nucleotide sequence encoding the human 544 \ 4 is shown in Figures 12A-D and 
25 is set forth as SEQ ID NO:7. The protein encoded by this nucleic acid comprises about 

1118 amino acids and has the amino acid sequence shown in Figures 12A-D and set forth as 
SEQ ID NO:8, The coding region (open reading frame) of SEQ ID NO:7 is set forth as SEQ 
ID NO:9. Clone Fbh54414, comprising the coding region of human 54414, was deposited 
with the American Type Culture Collection (ATCC®), 10801 University Boulevard, 

30 Manassas, VA 201 10-2209, on ;, and assigned Accession No. . 

The nucleotide sequence encoding the human 53763 is shown in Figures 1 6A-C and 
is set forth as SEQ ID NO: 10. The protein encoded by this nucleic acid comprises about 
638 amino acids and has the amino acid sequence shown in Figures 16A-C and set forth as 
SEQ ID NO:l 1 . The coding region (open reading frame) of SEQ ID NO: 10 is set forth as 
35 SEQ ID NO: 12. Clone Fbh53763, comprising the coding region of human 53763, was 
deposited with the American Type Culture Collection (ATCC®), 10801 University 
Boulevard, Manassas, V A 20 1 1 0-2209, on and assigned Accession No. . 
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Analysis of the human 54414 and 53763 Molecules 

The amino acid sequences of human 54414 was analyzed using the program PSORT 
to predict the ldcalizatibn of the proteins within the cell.. The results of thfc analyses show 
that human 54414 may be localized to the endoplasmic reticulum, the nucleus, secretory 
5 vesicles, or the mitochondria. 

Analysis of the amino acid sequences of human 54414 was performed using 
MEMS AT. The amino acid sequence of human 54414 was also compared to the amino acid 
sequences of known potassiunl transporters (Figures 15A-B). This analysis resulted 'in the 
identification of six possible transmembrane domains in the amino acid sequence of human 
10 54414 at residues 64-83, 104-127, 135-153, 161-173, 199-217, and 257-274 of SEQ ID 
NO:8 (Figure 13). * 

Searches of the amino acid sequences of human 54414 were performed against, the 
HMM database (Figure 14). These searches resulted in the identification of an "ion 
transport protein domain", at about residues T04-277 of SEQ ID NO:8 (score= 62.4). 

1 5 Searches of the amino acid sequence of human 544 14 were further performed against 

the Prosite™ database. These searches resulted in the identification of several possible N- 
glycosylation sites at about amino acids .residues 66-69, 99-102; 290-293, 545-548, 554-557, 
573-576, 981-984, and 1 106-1 109 of SEQ ID NO:8, several possible cAMP- and cGMP- 
dependent protein kinase phosphorylation sites at about amino acids residues 625-628, 994- 

20 997, 1002-1005, and 1 100-1 103 of SEQ ID NO:8,,several possible protein kinase C 

phosphorylation sites at about amino acid residues 43-45, 59-61, 68-70, 126-128, 158-160, 
254-256, 298-300, 308-310, 354-356, 385-387, 464-466, 605-607, 903-905, 939-941, 947- 
949, 1005-1007, 1012-1014, 1030-1032, and 1099-1101 of SEQ IDNO:8, several possible 
casein kinase II phosphorylation sites at about amino acid residues 43-46, 115-11 8, 338-341 , 

25 386-389, 393-396, 485-488, 556-559, 651-654, 655-658, 663-666, 784-787, 837-840, 867- 
870, 907-910, 926-929, 943-946, 959-962, 983-986, 1Q74- 1077, 1 108-1 11 1, and 1112 -1115 
of SEQ ID NO:8, several possible N-myristoylation sites at about amino acid residues 89- 
94, 106-1 11, 288-293, 679-684, 767-772, 847-852, and 933-938 of SEQ ID NO:8, a possible 
amidation site at about amino acid residues 223-226 of SEQ ID NO:8, and a possible 

30 ATP/GTP-binding site motif A (P-loop) at about amino acid residues 1008-1015 of SEQ ID 
NO:8. 

The amino acid sequence of human 53763 was analyzed using the program PSORT 
to predict the localization of the proteins within the cell. The results of the analyses further 
show that human 53763 may be localized to the endoplasmic reticulum, the mitochondria, or 
35 the nucleus. 

Analysis of the amino acid sequences of human 53763 was performed using 
MEMS AT. The amino acid sequence of human 53763 was also compared to the amino acid 
sequences of known potassium transporters (Figure 19). This analysis resulted in the 
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identification of six possible transmembrane domains in the amino acid sequence of human 
53763 at residues 230-248, 287-303, 314-335, 346-368, 382-402, and 451-473 of SEQ ID • 
NO: 11 (Figure 17). . ■ ' ' " "\ • v. 

Searches of the amino acid sequence of human ,537^3 were, also performied against 

5 the HMM database (Figures 18A-B). These searches resulted in the identification of a 
tc NADH-ubiquinorie/plastoquinone oxidoreductase domain" at about residues 3 17-467 of 
SEQ ID NO: 1 1 (score= -8.1 .7), an "ion transport protein domain" at about residues 28 1-472 
of SEQ ID NO: 1 1 (score = 1 1 6.9), and a a K + channel tetramerisation domain" at about 
residues 8-156 of SEQ ID NO:l 1 (score = 156.7). 

10 Searches of the amino 1 acid sequence of human 53763 were also performed against 

1 fhe Prosite™ database. These searches resulted in the identification in the amino acid 1 
sequence of human 53763 a number of potential N-glycosylation sites at amino acid 
residues 84-84, 259-262, 266-269, 518-521, and 536-539 of SEQ ID NO: l 1, a potential 
cAMP- and cGMP-dependent protein kinase phosphorylation site at amino acid residues 

15 56 1 -564 of SEQ ID NO: 1 1 , protein kinase C phosphorylation, sites at amino acid residues 
21-23,25-27, 86-88, 120-122, 155-157, 205-207, 224-226, 336-338, 374-376, and 564-566 
of SEQ ID NO: 1 1 , casein kinase II phosphorylation sites a,t amino acid residues 1 7-20, 49- 
52, 146-149, 283-286, 378-381, 414-417, 520-523, 541-544, 546-549, 553-556, 564-567, 
and 579-582 of SEQ ID NO: 1 1 , and N-myristoylation sites at amino acid residues 3 1 -36, 

20 76-81, 83-88, 89-94, 142-147, 176-181, 191-196, 199-204, 407-412, 450-455, 477-482, 
590-595, and 606-611 of SEQ ID NO:l 1. 

Searches of the amino acid sequences of human 54414 and human 53763 were also 
performed against the ProDom database. A potassium ionic calcium activated domain and 
two potassium ionic subunits were identified in the amino acid sequence of 544 1 4 (SEQ ID 

25 NO:8). Several transmembrane domains and transport family domains were identified in the 
the amino acid sequence of 53763 (SEQ ID NO:ll). . y 

The amino acid sequences of human 54414 and human 53763 were further analyzed 
for the presence of a "pore domain", also known as a "P-region domain". A pore domain 
was identified in the amino acid sequence of human 54414 at about residues 229-250 of 

30 SEQ ID NO:8. A pore domain was identified in the amino acid sequence of human 53763 
at about residues 426-44 1 of SEQ ID NO: 11. 

The amino acid sequences of human 54414 and human 53763 were also analyzed for 
the presence of a "potassium channel signature sequence motif (see Joiner, W. J. et aL 
(1998) Nat Neurosci. 1 :462-469 and references cited therein). A potassium channel 

35 signature sequence motif was identified in the amino acid sequence of human 54414 at 
about residues 239-246 of SEQ ID NO: 8. A potassium channel signature sequence motif 
was identified in the amino acid sequence of human 53763 at about residues 436-441 of 
SEQ IDNOrll. 
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The amino acid'sequence of human 53763 was also analyzed for the presence of a 
"voltage sensor motif'. A voltage sensor motif was identified in the amino acid sequence of 
human 53763 at about residues 348-363 of SEQ ID NQ:1 1 . 'Positively charged amino acid 
residues in the voltage sensor motif were identified about residues 348, 351, 354, 357, 360, 
5 and 363 of SEQ ID NO:5. 

Isolation of the human 67P76, 67102, 44181, 67084FL, or 67084alt cDNAs 

The invention is based, at least in part, on the discovery of a human gene encoding 
novel polypeptides, referred to herein as human 67076, 67102, 44181, 67084FL, and 

10 67084alt. The entire sequence of the human clone 67076 was determined and found to 
contain an open reading frame termed human "67076." The nucleotide sequence of the ' 
human 67076 gene is set forth in Figures 20A-E and in the Sequence Listing as SEQ ID 
, ,NO:l 3. The amino acid sequence of the human 67076 expression product is set forth in 
Figures 20A-E and in the Sequence Listing as SEQ ID NO: 14. The 67076 polypeptide 

15 comprises 1129 amino acids. The coding region (open reading frame) of SEQ ID NO: 13- is 
set forth as SEQ ID NO: 15. Clone 67076, comprising the coding region of human 67076, 
was deposited with the American Type Culture Collection <ATCC®), 10801 University 

Boulevard, Manassas, VA 201 10-2209, on , and assigned Accession No. 

The entire sequence of the human clone 67102 as determined and found to contain 

20 an open reading frame termed human "67102." The nucleotide sequence of the human gene 
is set forth in Figures 24A-E and in the Sequence Listing as SEQ ID NO: 16. The amino 
acid sequence of the human 67 1 02 expression product is set forth in Figures 24 A-E and in 
the Sequence Listing as SEQ ID NO: 1 7. The 67 1 02 polypeptide comprises 1 426 amino 
acids. The coding region (open reading frame) of SEQ ID NO:16 is set forth as SEQ ID 

25 NO: 18. Clone 67 1 02, comprising the coding region of human 67 1 02, was deposited with 
the American Type Culture Collection (ATCC®), 10801 University Boulevard, Manassas, 

VA 20 1 1 0-2209, on and assigned Accession No. . 

The entire sequence of the human clone 44181 was determined and found to contain 
an open reading frame termed human "441 81 The nucleotide sequence of the human 

30 44181 gene is set forth in Figures 28A-E and in the Sequence Listing as SEQ ID NO: 1 9. 
The amino acid sequence of the human 44181 expression product is set forth in Figure 7 A-E 
and in the Sequence Listing as SEQ ID NO:20. The 441 81 polypeptide comprises 1 177 
amino acids. The coding region (open reading frame) of SEQ ID NO: 19 is set forth as SEQ 
ID NO:21. Clone 44181, comprising the coding region of human 44181, was deposited with 

35 the American Type Culture Collection (ATCC®), 10801 University Boulevard, Manassas, 

VA 201 10-2209, on . and assigned Accession No. 

The entire sequence of the human clone 67084FL was determined and found to 
contain an open reading frame termed human "67084FL." The nucleotide sequence of the 
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human 67084FL gene is set forth in Figures 32A-D and in the Sequence Listing as SEQ ID 
NO:22. The amino add sequence of the human 67084FL expression product is set forth in 
Figures 32A-D and in the Sequence Listing as SEQ ID NO -23. The 67.084FL polypeptide 
comprises 1 084 amino acids. The coding region (open reading frame) of SEQ ID NO:22 is 
5 set forth as SEQ ID NO:24. Clone 67084FL, comprising the coding region of human 
67084FL, was deposited with the American Type Culture Collection (ATCC®), 10801 
University Boulevard, Manassas, VA 201 10-2209, on - ' and assigned Accession 

No. i ' ' 1 \ *' ; 

The entire sequence of the human clone 67084alt was determined and found to 

10 contain an open reading frame termed human "67084alt." The nucleotide sequence of the 
human 67084alt gene is set forth in Figures 36A-D and in the Sequence Listing as SEQ ID 
NO:25 v The amino acid sequence of the human 67084alt expression product is set forjh in 
Figures 36A-D and in the Sequence Listing as SEQ ID NO:26. The 67084alt polypeptide 
comprises 1095 amino acids. The coding region (open reading frame) of SEQ ID NO:25 is 

15 set forth as SEQ ID NO:27. Clone 67084alt, comprising the coding region of human 
67084alt, was deposited with the American Type Culture Collection (ATCC®), 1 080 1 
University Boulevard, Manassas, VA 201 10-2209, on 1 , and assigned Accession 

No._ . 

20 Analysis of the Human 67076, 67102, 44181, 67084FL, or 67084alt Molecules 

The amino acid sequences of human 67076, 67102, 44181, 67084FL, or 67084alt 
were analyzed for the presence of sequence motifs specific for P-type ATPases (as defined 
in Tang, X. et al. (1996) Science 272:1495-1497 and Fagan, M. J. and Saier, M. H. (1994) J. 
Mol EvoL 38:57). 

25 These analyses resulted in the identification of a P-type ATPase sequence I motif in 

the amino acid sequence of human 67076 at residues 173-181 of SEQ ID NO:14, in the 
amino acid sequence of human 67102 at residues 208-216 of SEQ ID NO:17, in the amino 
acid sequence of human 44181 at residues 173-181 of SEQ ID NO:20, in the amino acid 
sequence of human 67084FL at residues 213-221 of SEQ ID NO:23, and in the amino acid 

30 sequence of human 67084alt at residues 213-221 of SEQ ID NO:26. 

These analyses also resulted in the identification of a P-type ATPase sequence 2 
motif in the amino acid sequence of human 67076 at residues 406-415 of SEQ ID NO: 14, in 
the amino acid sequence of human 67102 at residues 435-444 of SEQ ID NO:17, in the 
amino acid sequence of human 44181 at residues 404-413 of SEQ ID NO:20, in the amino 

35 acid sequence of human 67084FL at residues 41 3-422 of SEQ ID NO:23, and in the amino 
acid sequence of human 67084alt at residues 4 1 3-422 of SEQ ID NO:26. 

These analyses further resulted in the identification of a P-type ATPase sequence 3 
motif in the amino acid sequence of human 67076 at residues 813-824 of SEQ ID NO:14, in 
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the amino acid sequence of human 67102 at residues 1054-1064 of SEQ ID NO: 17, in the 
amino acid sequence of human 44181 at residues 8 1 9-82? of SEQ ID NO:20, in the amino ■ 
acid sequence of human 67084FL at fesidues 820-830 of SEQ IP NO;23, and in the amino 
acid sequence of human 67084alt at residues 820-830 of SEQ ID NO:26. 
5 The amino acid sequences of human 67076, 67102, 44181, 67084FL, and 67084alt 

were also analyzed for the presence of phospholipid transporter specific amino acid residues 
(as defined in Tang, X. et al. (1996) Science 272:1495-1497). These analyses also resulted 
in the identification of phospholipid transporter specific amino acid residues in the amino 
acid sequence of human 67076 atresidues 174, 177, 407, 813, 823, and 824 of SEQ ID 

10 NO: 14. These analyses resultbd in the identification of phospholipid transporter specific 

, 1 ?mino acid residues 208, 209, 212, 436, 1054, 1063, and 1064 in the amino acid sequence of 
human 67102 at* residues of SEQ ID NO:17. These analyses further resulted in the 
identification of phospholipid transporter specific amino acid residues 174, 177, 405, 819, 
_ 928, and 929 in the amino acid sequence of human 44 1 8 1 at residues of SEQ ID NO:20. 

1 5 These analyses farther resulted in the identification of phospholipid transporter specific 
amino acid residues 214, 217, 820, 829, and 830 in the amino acid sequence of human 
67084FL at residues of SEQ ID NO:23. These analyses stjll further resulted in the 
identification of phospholipid transporter specific amino acid residues 214, 217, 820, 829, 
and 830 in the amino acid sequence of human 67084alt at residues of SEQ ID NO:26. 

20 The amino acid sequences of human 67076, 67 1 02, 441 81 , 67084FL, and 67084alt 

were also analyzed for the presence of extramembrane domains. An N-terminal large 
extramembrane domain was identified in the amino acid sequence of human 67076 at 
residues 1 05-291 of SEQ ID NO: 14. A C-terminal large extramembrane domain was 
identified in the amino acid sequence of human 67076 at residues 366-872 of SEQ ID 

25 NO: 14. An N-terminal large extramembrane domain was identified in the amino acid 
sequence of human 67102 at residues 141-321 of SEQ ID NO:17. A C-terminal large 
extramembrane domain was identified in the amino acid sequence of human 671 02 at 
residues 391-581 of SEQ ID NO: 17. An N-terminal large extramembrane domain was 
identified in the amino acid sequence of human 44 1 81 at residues 1 05-289 of SEQ ID 

30 NO:20. A C-terminal large extramembrane domain was identified in the amino acid 
sequence of human 44181 at residues 364-877 of SEQ ID NO:20. An N-terminal large 
extramembrane domain was identified in the amino acid sequence of human 67084FL at 
residues 1 45-33 0 of SEQ ID NO :23 . A C-terminal large extramembrane domain was 
identified in the amino acid sequence of human 67084FL at residues 380-886 of SEQ ID 

35 NO:23 . An N-terminal large extramembrane domain was identified in the amino acid 
sequence of human 67084alt at residues 145-330 of SEQ ID NO:26. A C-terminal large 
extramembrane domain was identified in the amino acid sequence of human 67084alt at 
residues 380-886 of SEQ ID NO:26. 
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The amino acitf sequence of human 67076 was analyzed using the program PSORT 
to predict the localization of the proteins within the cell. This program assesses the presence 
of different .targeting and localization amiri6 acid sequences within the query sequence. The 
5 results of this analysis predict that human 67076 may be localized to the endoplasmic 
reticulum. 

Searches of the amino acid sequence of human 67076 were further performed against 
the Prosite database. These searches resulted in the identification in the amino acid ' ; 
sequence of human 67076 of a number of potential N-glycosylation sites at amino acid 

10 residuesl21'-124, 392-395, 761-764, 992-995, and 1098-1101 of SEQ ID NO:14, a number 
of potential cAMP-and cGMP-dependent protein kinase phosphorylation sites at amino acid 
residuesl'35-13&, 545-548, 1091-1094, and 1 102-1 105 of SEQ ID NO:14, a number of 
potential protein kinase C phosphorylation sites at amino acid residues 47-49, 138-140, 204- 
206, 250-252, 254-256, 278-280,308-310, 328-330, 334-336, 408-410, 680-682, 701-703, 

15 708-710, 733-735, 736-738, 10Q8-1010, 1094-1096, 1 100-1 102, 1 109-1 1 1 1, and 1 1 13-1 115 
of SEQ ID NO:14, a number of casein kinase II phosphoiylatidn sites at amino acid residues 
30-33, 264-267, 282-285, 328-331, 413-416, 442-445, 449-452, 494-497, 646-649, 693-696, 
704-707, 762-765, 813-816, 924-927, 982-985, and 1 121-1 124 of SEQ ID NO:14, a number 
of potential tyrosine kinase phosphorylation sites at amino acid residues 252-258, 739-747 

20 of SEQ ID NO:14, a number of N-myristoylation sites at amino acid residues 388-393, 440- 
445, 482-487, 514-519, 564-569, 753-758, and 807-812 of SEQ ID NO: 14, an ATP/GTP- 
binding site motif (P-loop) at amino acid residues 271-278 of SEQ ID NO:14, and an E1-E2 
ATPases phosphorylation site at amino acid residues 409-4 1 5 of SEQ ID NO: 1 4. 

A MEMSAT analysis of the polypeptide sequence of SEQ ID NO: 1 4 was also 

25 performed, predicting three potential transmembrane domains in the amino acid sequence of 
human 67076 (SEQ ID NO: 14). However, a structural^ hydrophobicity, and antigenicity 
analysis (Figure 21) resulted in the identification of ten transmembrane domains. 
Accordingly, the 67076 protein of SEQ ID NO: 14 is predicted to have at least ten 
transmembrane domains, identified as transmembrane (TM) domains 1 through 10, at about 

30 residues 57-77, 84-105, 292-313, 345-365, 863-883, 905-926, 956-977, 989-1009, 1021- 
104U and 1060-1087. 

A search using the polypeptide sequence of SEQ ID NO: 14 was performed against 
the HMM database in PFAM resulting in the identification of a potential hydrolase domain 
in the amino acid sequence of human 67076 at about residues 403-837 of SEQ ID NO: 14 

35 (score = 12.7). 

A search of the amino acid sequence of human 67076 was also performed against the 
ProDom database resulting in the identification of several hydrolase domains and 
phosphorylation domains in the amino acid sequence of 67076 (SEQ ID NO: 14). 
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The amino acid sequence of human 671 02 was analyzed using the program PSORT. 
The results of this analysis predict that human 67102 I may be localized to the eridoplasmic 
reticulum. ■ 

5 Searches of the amino acid sequence of human 67102 were further performed against • 

the Prosite database. These searches resulted in the identification in the amino acid 
sequence of human 67102 of a number of potential t N-glycosylation sites at amino acid 
residues 29-32, 303-306, 1363-1368, and 1397-1400 of SEQ IDNO':17, a ' I. 

glycdsaminoglycari attachment site at amino acid residues 526-529 of SEQ ID NO: 17, a 

10 number of potential cAMP^ and pGMP-dependent protein kinase phosphorylation sites at 
amino acid residues 38-41 , 451-545, 635-638, and 777-780 of SEQ ID NO:17, a number of 
protein kinase C phosphorylation sites at amino acid residues 47-49, 78-80, 161-163, 240- 
242, 262-264, 280-282, 437-439, 500-502, 563-565, 633-635, 64,4-646, 695-697, 743-745, 
774-776, 827-829, 1000-1002, 1360-1362, arid 1371-1373 of SEQ ID NO: 17, a number of 

15 potential casein kinase II phpsphbrylation sites at amino acid residues 20-23, 161-1 64, 176- 
179, 184-187, ,199-202, 210-213, 232-235, 241-244, 262-265, 312-315, 34^-348, 405-408, 
442-445, 471-474, 477-480, 543-546, 621-624, 644-647, 670-673, 693-696, 727-730, 850- 
853, 866-869, 892-895, 977-980, 1074-1077, 1141-1144, 1199-1202, 1221-1224, 1339- 
1342, 1399-1402, and 1403-1406 of SEQ ID NO:17, two tyrosine kinase phosphorylation 

20 sites at amino acid residues 21-28 and 847-854 of SEQ ID NO: 1 7, a number of potential N- 
myristoylation sites at amino acid residues 69-74, 341-346, 488-493, 510-515, 519-524, 
525-530, 651-656, 703-708, 714-719, 901-906, 955-960, 992-997, 1070-1075, 1 139-1144, 
1229-1234, and 1261-1266 of SEQ ID NO:17, two potential amidation sites at amino acid 
residues 36-39 and 1371-1374 of SEQ ID NO:17, two AJP/GTP-binding site motif A (P- 

25 loop) at amino acid residues 996-1003 and 1364-1371, an E1-E2 ATPases phosphorylation 
site at amino acid residues 438-444 of SEQ ID NO: 1 7, and a prokaryotic membrane 
lipoprotein lipid attachment site at amino acid residues 26-36 of SEQ ID NO: 17. 

A MEMS AT analysis of the polypeptide sequence of SEQ ID NO: 1 7 was also 
performed, predicting ten potential transmembrane domains in the amino acid sequence of 

30 human 67102 (SEQ ID NO:17) at about residues 98-1 15, 122-140, 322-344, 366-390, 582- 
601, 752-770, 1145-1166, 1225-1246, 1253-1276, and 1298-1317. 

A search using the polypeptide sequence of SEQ ID NO: 1 7 was performed against 
the HMM database in PFAM resulting in the identification of a potential hydrolase domain 
in the amino acid sequence of human 67102 at about residues 432-1 077 of SEQ ID NO: 17 

35 (score =1 .5), and the identification of a potential DUF6 domain in the amino acid sequence 
of human 67102 at about residues 1 127-1271 of SEQ ID NO:l 7 (score = -24.6). 



138 



BNSDOCID: <WO__02055701 A2_l_> 



WO 02/055701 PCT/US01/49060 
i ... 

A search of the amino acid sequence of human 67102 was also performed against the 
ProDoni database resulting in the identification of several, hydrolase domains and 
phosphorylation domains in the amino acid sequefcfce of 6671 02, (SEQ ID NO: 1 7). 

5 The amino acid sequence of human 441 81 wa£ analyzed using the program PSORT. 

The results of this analysis predict that human 44 1 8 1 may be localized to the endoplasmic 
reticulum. . ; 

Searches of the amino acid sequence of htiman 44181 were further performed against 
the Prosite database. These searches resulted in the identification in the amino acid 

10 sequence of human 44 1 81 of 4 number of potential Mrglycosylation sites at amino acid 
• residues 331-334, 390-393, 449-452, 461-464, 477-480, 786-789, and 998-1001 of SEQ if) 
NO:20, a number of potential cAMP- and cGMP-dependent protein kinase phosphorylation 
sites at amino acid residues 577-580, 633-636, and 750-753 of SEQ ID NO:20, a number of 
protein kinase C phosphorylation sites at amino acid residues 46-48, 163-165, 276-278, 332- 

15 334, 406-408, 470-472, 574-576, 636-638, 957-959, 1014-1016, and 1 102-1 104 of SEQ ID 
NO:20, a number of potential casein kinase C phosphorylation sites at amino acid residues 
115-118, 262-265, 280-283, 41 1-414, 473-476, 520-523, 527-530, 636-639, 678-681, 737- 
740, 906-909, 929-932, 1100-1103, 1154-1 157, and 1170-1173 of SEQ IDNO:20,a 
potential tyrosine kinase phosphorylation site at amino acid residues 316-322 of SEQ ID 

20 NO:20, a number of potential N-myristoylation sites at amino acid residues 131-136,596- 
601, 766-771, and 993-998 of SEQ ID NO:20, and an E1-E2 ATPases phosphorylation site 
at amino acid residues 407-4 1 3 of SEQ ID NO:20. 

A MEMSAT analysis of the polypeptide sequence of SEQ ID NO:20 was also 

performed, predicting three potential transmembrane domains in the amino acid sequence of 

i 

25 human 44 1 8 1 (SEQ ID NO:20). However, a structural, hydrophobicity, and antigenicity 
analysis (Figure 29) resulted in the identification of ten transmembrane domains. 
Accordingly, the 44 1 8 1 protein (SEQ ID NO :20) is predicted to have at least ten 
transmembrane domains, which are identified as transmembrane (TM) domains 1 through 
10, at about residues 56-72, 87-103, 290-31 1, 343-363, 878-898, 91 1-931, 961-982, 995- 

30 1015, 1027-1047, and 1062-1086. 

A search using the polypeptide sequence of SEQ ID NO:20 was performed against 
the HMM database in PFAM resulting in the identification of a potential E1-E2 ATPase 
domain in the amino acid sequence of human 44181 at about residues 126-164 of SEQ ID 
NO:20 (score =8.6), the identification of a potential DUF1 32 domain in the amino acid 

35 sequence of human 44181 at about residues 579-719 of SEQ ID NO:20 (score » -72.9), and 
the identification of a potential hydrolase domain in the amino acid sequence of human 
44181 at about residues 401-842 of SEQ ID NO:20 (score = 42.8). 
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A search of the amino acid sequence of human 44 1 8 1 was also performed against the 
ProDom database A search of the amino, acid sequence of human 44181 was also performed 
against the ProDom database resulting in the identification of several hydrolase domains and 
phosphorylation domains in the amino acid sequence of 441 81 (SEQ ID NG:20). 
5 A Clustal W (1.74) alignment of the amino acid sequence of human 44181 (SEQ ID 

NO:20) and human potential phospholipid-transporting ATPase IR (ATIR; GenBank 
Accession No.:Q9Y2G3) revealed some sequence homology between 44181 and Accession 
No.:Q9Y2G3. < 

10 The amino acid sequence of human 67084FL was analyzed using the program 

PSORT. The results of this analysis predict that human 67084FL may be localized to the ' 
endoplasmic reticulum. 
, , Searches of the amino acid sequence of human 67084FL were further performed 

against the Prosite database. These searches resulted in the identification in the amino acid 

15 sequence of human 67084FL of a number of potential N-glycosylation sites at amino acid 
residues 3 10-3 13, 464-467, and 529-532 of SEQ ID NO:23, a potential cAMP- and cGMP- 
dependent protein kinase phosphorylation site at amino acid residues 1071-1074 of SEQ. ID 
NO:23, a number of protein kinase C phosphorylation sites 82-84, 168-170, 204-206, 301- 
303, 371-373, 415-417, 486-488, 585-587, 666-668, 744-746, 800-802, 813-815, 872-874, 

20 957-959, and 1009-1011 of SEQ ID NO:23, a number of potential casein kinase II 

phosphorylation sites at amio acid residues 265-268, 301-304, 402-405, 422-425, 535-538, 
596-599, 661-664, 686-689, and 745-748 of SEQ ID NO:23, a tyrosine kinase 
phosphorylation site at amino acid residues 813-816 of SEQ ID NO:23, a number of 
potential N-myristoylation sites at amino acid residues 292-297, 462-467, 568-573, 606- 

25 611, 824-829, 887-892, and 1026-1031 of SEQ ID NO:23, a potential amidation site at 
amino acid residues 813-816 of SEQ ID NO:23, a prokaryotic membrane lipoprotein lipid 
attachment site at amino acid residues 105-1 15, a leucine zipper pattern at amino acid 
residues 325-346, and an E1-E2 ATPases phosphorylation site at amino acid residues 416- 
422 of SEQ ID NO:23. 

30 A MEMSAT analysis of the polypeptide sequence of SEQ ID NO:23 was also 

performed, predicting nine potential transmembrane domains in the amino acid sequence of 
human 67084FL (SEQ ID NO:23). However, a structural, hydrophobicity, and antigenicity 
analysis (Figure 33) resulted in the identification often transmembrane domains. 

Accordingly, the 67084FL protein of SEQ ID NO:23 is predicted to have at least ten 
35 transmembrane domains, which are identified as transmembrane (TM) domains 1 through 
10, at about residues 104-120, 124-144, 331-350, 357-374, 887-903, 912-931, 961-983, 990- 
1008, 1015-1035, and 1043-1067. 
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A search using the polypeptide sequence of SEQ ID NO:23 was performed against 
the HMM database in'^FAM resulting in the identification of two potential E1-E2 ATPase 
in the aminoacid sequence of human 67084FL at about residues 171-199 r of SEQIDNO:23 
(score = 3.0) and 277-305 of SEQ ID NO:23 (score = 13 t 0), and a hydrolase domain at about 
5 residues 410-843 of of SEQ ID NO:23 (score = 19.2). 

A search of the amino acid sequence of human 67G84FL was also performed against 
the ProDom database resulting in the identification ,of several hydrolase domains, 
phosphorylation domains, and 'ATPase domains in the amino acid sequence of 67084FLJ 
(SEQIDNO:23): 

10 A Ciustal W (1-74) alignment of the amino acid sequence of human 67084FL (SEQ 

ID NO:23) and human membrane transport protein (MTRP-1; GenBank Accession 
No. ; Y7 1 656, International Publicaiton No: WO 2000/26245-A2) revealed some sequence 
homology between 67084FL and Accession No.: Y71056. . • 

~~ 15 The amino acid seque^cfe of human 67084alt was analyzed.using the program . 

PSORX The results of this analysis predict that human 67084alt may be localized to the 
endoplasmic reticulum. , - 

Searches of the amino acid sequence of human 67084alt were further performed 
against the Prosite database. These searches resulted in the identification in the amino acid 
20 sequence of human 67084alt of a number of potential N-glycosylation site^s at amino acid 
residues 310-313, 464-467, and 529-532 of SEQ ID NO:26, a potential cAMP- and cGMP- 
dependent protein kinase phosphorylation site at amino acid residues 1083-1086, a number 
of protein kinase C phosphorylation sites at amino acid residues 82-84, 168-170, 204-2-6, 
301-303, 371-373, 415-417, 486-488, 585-587, 666-668,744-746, 800-802, 813-815, 872- 
25 874, 957-959, and 1009-101 1 of SEQ ID NO:26, a number of potential casein kinase II 

phosphorylation sites at amino acid residues 265-268, 301-304, 402-405, 422-445, 535-538, 
596-599, 661-664, 686-689, and 745-748 of SEQ ID NO:26, a tyrosine kinase 
phosphorylation site at amino acid residues 641-648, a number of potential N- . 
myristoylation sites at amino acid residues 292-297, 462-467, 568-573, 606-61 1, 824-829, 
30 887-892, and 1026-1031 of SEQ ID NO:26, a potential amidation site at amino acid residues 
813-816 of SEQ ID NO:26, a potential prokaryotic membrane lipoprotein lipid attachment 
site at amino acid residues 1 05-1 1 5 of SEQ ID NO:26, a leucine zipper pattern at amino acid 
residues 325-346 of SEQ ID NO:26, and an El -E2 ATPases phosphorylation site at amino 
acid residues 416-422 of SEQ ID NO:26. 
35 A MEMSAT analysis of the polypeptide sequence of SEQ ID NO:26 was also 

performed, predicting nine potential transmembrane domains in the amino acid sequence of 
human 67084alt (SEQ ID NO:26). However, a structural, hydrophobicity, and antigenicity 
analysis (Figure 37) resulted in the identification of ten transmembrane domains. 

141 

BNSDOCID: <WO_02055701A2_l_> 



WO 02/055701 PQT/US01/49060 

Accordingly, the 67084alt protean of SEQ ID NQ:26 is predicted ( to have ^t least ten! 
transmembrane domains, which are identified as transmembrane (TM) domains 1 through 
10, at about residues 104-120, 1 124-144, 33 J-350, 357-374, 887-903, 91^-931, 961-983, 
990-1008, 1015-1035, and 1043-1067. 

A search using the polypeptide sequence of SEQ ID NO:26 was performed against 
the HMM database in PFAM resulting in the identificatiqn of two potential E1-E2 ATPase 
in the amino acid sequence of human 67084alt at about residues 42-70 of SEQ ID NQ:26 
(score = 3.0) and 105-133 of S^Q ID NO:26 (score = 13.0), and a potential hydrolase 1 
domain at about amino acid residues 410-843 of SEQ ID NO:26 (score = 19.2). 

A search of the amino acid sequence of human 67084alt was also performed against 
the ProDom database resulting in the identification of Several hydrolase domains, 
phosphorylation domains, and ATPasd domains in the amino acid sequence of 67084alt 
(SEQ ID NO:26). i 

A Clustal W (1.74) alignment of the amino acid sequence of human 67084alt (SEQ ID 
NO: 1 4) and human membrane transport protein (MTRP- 1 ; Ger^Bank Accession 
No. :Y7 1056, International Publication No. WO 2000/2 6245-A2) revealed some sequence 
homology between 67084alt and Accession No. : Y7 1056. 

EXAMPLE 2: TISSUE EXPRESSION OF HUMAN 8099, 46455, 54414, 53763, 

67076, 67102, 44181, full length 67084 (67084FL), and 67084alt mRNA 

Tissue Distribution of Human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
and 67084alt mRNA Using Taqman™ Analysis 

This example describes the tissue distribution of hurpan 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt mRNA in a variety of cells and tissues, as 
determined using the TaqMan™ procedure. The Taqman™ procedure is a quantitative, 
reverse transcription PCR-based approach for detecting mRNA. The RT-PCR reaction 
exploits the 5' nuclease activity of AmpliTaq Gold™ DNA Polymerase to cleave a 
TaqMan™ probe during PCR. Briefly, cDNA was generated from the samples of interest, 
e.g., lung, ovary, colon, and breast normal and tumor samples, and used as the starting 
material for PCR amplification. In addition to the 5' and 3 ' gene-specific primers, a gene- 
specific oligonucleotide probe (complementary to the region being amplified) was included 
in the reaction (/. e. , the Taqman™ probe). The TaqMan™ probe includes the 
oligonucleotide with a fluorescent reporter dye covalently linked to the 5' end of the probe 
(such as FAM (6-carboxyfluorescein), TET (6-carboxy-4,7,2',7'-tetrachlorofluorescein), 
JOE (6-carboxy-4,5-dichloro-2,7-dimethoxyfluorescein), or VIC) and a quencher dye 
(TAMRA (6-carboxy-N,N,N',N 5 -tetramethylrhodamine) at the 3' end of the probe. 

During the PCR reaction, cleavage of the probe separates the reporter dye and the 
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quencher dye, resulting in increased fluorescence of the reporter. Accumulation of PCR 
products is detected directly by monitoring the increase in fluorescence of the reporter dye. ■ 
When the probe is intact, the proximity of the reporter dye to the quencher dye results in 
suppression of the reporter fluorescence. During PCR,, if the target of interest is present, the 

5 probe specifically anneals between the forward and reverse primer sites. The 5* -3' 

nucleolytic activity of the AmpliTaq™ Gold DNA Polymerase cleaves the probe between 
the reporter and the quencher only if the probe' hybridizes to the target. The probe fragments 
are then displaced from the target, and polymerization of the 1 strand continues. The 3' end of 
the probe is blocked to prevent extension of the probe during PCR. This process occurs in 

10 every cycle and does not interfere with the exponential accumulation of product. RNA was 
• prepared using the trizol method, and treated with DNase to remove contaminating genomic 
DNA. cDNA was synthesized using standard techniques. Mock cDNA synthesis in the 
absence of reverse transcriptase resulted in samples with no detectable PCR amplification of 
the control gene confirms efficient removal of genomie DNA contamination. 

15 . - 

Tissue Distribution of Human 8099 

A human tissue panel was tested revealing highest .expression of human 8099 
mRNA in congestive heart failure (CHF) heart, normal prostate, and brain (see Table 1, 
below). 



TABLE 1. 



Tissue Type 


Mean 


&2 
Mean 


aact 


Expression 


Artery normal 


30.83 


22.31 


8.52 


2.7241 


Aorta diseased 


32.77 


22.32 


10.45 


0.7149 


Vein normal 


29.41 


20.23 


9.18 


1.724 


Coronary SMC 


31.2 


20.91 


10.3 


0.7932 


HUVEC 


32.16 


21.38 


10.78 


0.5687 


Hemangioma 


32.86 


19.66 


13.21 


0.1059 


Heart normal 


28.05 


20.43 


7.62 


5.0834 


Heart CHF 


26.98 


20.68 


6.3 


12.6914 


Kidney " 


27.76 


20.46 


7.3 


6.3238 


Skeletal Muscle 


29.7 


22.17 


7.53 


5.4294 


Adipose normal 


34.16 


20.59 


13.56 


0.0828 


Pancreas 


33.23 


22.29 


10.94 


0.5108 


primary osteoblasts 


32 


20.61 


11.39 


0.3726 


Osteoclasts (diff) 


30.9 


17.55 


13.35 


0.0958 


Skin normal 

1 : : 


34.12 


22.45 


11,68 


0.3058 


Spinal cord normal 


31.93 


21.07 


10.87 


0.5362 


Brain Cortex normal 


28.4 


22.34 


6.06 


14.9885 


Brain Hypothalamus normal 


29.68 


22.35 


7.34 


6.1936 


Nerve 


32.96 


22.25 


10.72 


0.5949 


DRG (Dorsal Root Ganglion) 


30.81 


22.15 


8.65 


2.4808 


Breast normal 


31.91 


21.14 


10.77 


0.5747 


Breast tumor 


32.73 


20.93 


11.81 


0.2785 


Ovary normal 


30.41 


19.82 


10.6 


0.6465 
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Ovary Tumor 


28.36 


19.06 


9.31 


1.5755 


Prostate Normal , , 


"27.29 


19.77 


7.52. 


5.4482 ' 



Tissue Distribution of Human 46455 ' 

A human vessel and tissue panel was tested revealing highest expression of human 
46455 riiRNA in human umbilical vein endothelial cells (HUVEC), erythroid pells, normal 
artery, megakaryocytes, kidney, and CHF heart. 46455 was expressed at higher levels in 
lung tumor, breast tumor, and Colon tumor versus normal lung, breast and colon tissues,' 
indicating a, possible role for 46455 in cellular proliferation disorders (see Table 2, below). 



10 TABLE 2. 



Tissue Type 


Mean 


fi 2 Mean 


3d Ct 


Expression 


Artery normal. 


28.05 


24.09 • 


2.67 


157.6722 


Aorta diseased 


28.75 


23.66 


3.79 


72.d429 


Vein normal 


27.75. 


21.72 


4.75 


37.1627 


Coronary SMC 


2812 


25.12 


1.78 


290.176 


HUVEC 


24.18 


22.59 


0.29 • 


817.9021 


Hemangioma 1 


25.15 


20.98 


2.88 


135.8419 


Heart normal 


26.44 


21.82 


3.33 


99.4421 


Heart CHF 


25.54 


21.09 


3.15 


112.6563 


Kidney 


25.98 


21.49 


3.2 


108.8188 


Skeletal Muscle 


28.22 


24.14 


2.79 


144.586 


Adipose normal 


28.38 


22.21 


4.88 


33.9605 


Pancreas 


27.91 


23.22 


3.4 . 


94.4045 


primary osteoblasts 


27.11 


21.85 


3.97 


63.8133 


Osteoclasts (diff) 


23.64 


18.8 


3.55 


85.3775 


Skin normal 


28.43 


23.27 


3.88 


68.1567 


Spinal cord normal 


26.88 


22.12 


3.47 


90.2456 


Brain Cortex normal 


26.42 


23.4' 


1.73 ; 


301.452 


Brain Hypothalamus normal 


28.1 


23.55 


3.26 


104.386 


Nerve 


28.59 1 


23.88 


3.43 


92.7827 


DRG (Dorsal Root Ganglion) 


28.33 


23.76 


3.28 


102.9489 


Breast normal 


27.31 


22.32 


3.7 


76.9465 


Breast tumor 


26.47 


22.11 


3.07 


119.0797 


Ovary normal 


26.59 


22.16 


3.13 


113.8337 


Ovary Tumor 


28.47 


21.84 


5.33 


24.8605 


Prostate Normal 


27.09 


21.68 


4.12 


57.5117 


Prostate Tumor 


26.51 


21.58 


3.64 


80.2141 


Salivary glands 


27.16 


20.81 


5.07 


29.8733 


Colon normal 


26.3 


20 


5 


31.1419 


Colon Tumor 


25.09 


20.52 


3.29 


102.5927 


Lung normal 


26.02 


19.75 


4.98 


31.6862 


Lung tumor 


25.09 


21.31 


2.48 


178.6243 


Lung COPD 


25.26 


19.71 


4.26 


52.193 


Colon IBD 


26.3 


18.91 


6.1 


14.5786 


Liver normal 


27.66 


21.8 


4.57 


42.101 


Liver fibrosis 


29.31 


24.09 


3.92 


65.8351 


Spleen normal 


27.41 


21.41 


4.71 


38.2075 


Tonsil normal 


25.23 


1932 


4.63 


40.5262 


Lymph node normal 


26.15 


20.35 


4.51 


43.8889 
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Small intestine normal 


28.23 


21.73 


5.22 


26.8302 


Skin-Decubitus 


27.18 


22.82 


' 3.06 


119.908 


Synovium 


28 


21.12 


5.59 • 


20.6889 


BM-MNC 


26.13 ' 


19.32 " 


'5.51 


21.9445 


Activated PBMC 


25.2 


17.95 


5.96 


16.12 


Neutrophils 


24.45 


19.5 


3.65 


79.66 


Megakaryocytes 


22.5 


18.95 


2.26 


208.772 


Erythroid 


24.2 


21.69 


1.23 


427.7975 



1 * 
5 i Tissue Distribution of Human 53763 • 

T ; : — n — 

A human vessel and tissue panel was tested revealing highest expression of human 
53763 mRNA in normal brain cortex, normal hypothalamus, prostate tumor, normal 
prostate, dorsal root ganglion, and normal breast tissue (see Table 3, below). 



TABLE 3. 



Tissue Tvoe 


Mean 


TL *> Moan 


as ct 

vv wl 


1 

Pvnrocc ion 


Artery normal 


40 


22.41 


16.07 


o 


Aorta diseased 


40 


22,05 


16.44 


0 


Vein normal 


40 


19.75 


18.74 


o 


Coronary SMC 


35.16 


21.86 


11.79 


0 


HUVEC 


40 


20.41 


18.08 


o 


Hemangioma 


40 


18.52 


19.96 


o 


Heart normal 


39.43 


19.55 


18.37 


o 


Heart CHF 


40 


18.98 


19.5 


0 


Kidney 


39.44 


19.76 


18.16 


0 


Skeletal Muscle 


38.97 


21.57 


15.89 


0 


Adipose normal 


40 


20.09 


18.4 


0 


Pancreas 


38.91 


20.84 


16.56 


0 


primary osteoblasts 


40 


19.87 


18.61 


0 


Osteoclasts (diff) 


40 


17.09 


21.4 


0 . 


Skin normal 


39.59 


21.22 


16.86 


0 


Spinal cord normal 


31.72 


20.14 


10.07 


0.9303 


Brain Cortex normal 


23.07 


21.56 


0.01 


996.5403 


Brain Hypothalamus normal 


26.15 


20.98 


3.65 


79.3844 


Nerve 


39.08 


•21.23 


16.33 


0 


DRG (Dorsal Root Gang/ion) 


31.66 


21.3 


8.86 


2.1596 


Breast normal 


27.25 


20.41 


5.33 


24.9468 


Breast tumor 


40 


20.02 


18.46 


0 


Ovary normal 


40 


19.66 


18.83 


0 


Ovary Tumor 


40 


19.7 


18.79 


0 


Prostate Normal 


29.68 


19.32 


8.85 


2.1671 


Prostate Tumor 


28.14 


19.95 


6.67 


9.8204 


Salivary glands 


40 


18.97 


19.52 


0 


Colon normal 


39.09 


17.8 


19.78 


0 


Colon Tumor 


40 


18.63 


19.86 


0 


Lung normal 


40 


17.49 


21 


0 


Lung tumor 


39.66 


19.81 


18.34 


0 


Lung COPD 


40 


17.97 


20.52 


0 


Colon IBD 


40 


17.3 


21.18 


0 



145 

BNSDOCID: <WO__Q2055701A2_l_> 



WO 02/055701 



PCT/US01/49060 



Liver normal 


40 


1Q *V7 

. I xJ.Ol 


1R Q1 


u 


Liver fibrosis 


40 

*tw 


91 <*4 


' 17 1*; 
I # . I o 


n 
u 


Snleen normal 


40 


1Q 97 


1 Q 99 • 


u 


Tonsil normal* 




1fi 7^ 
ID./v) 


' 1 ^ 94 


u.i/zoo. 


Lymph •node normal 


38.61 


1ft 4 


1ft 7 
, JO,/ 


u 


Small intestine normal 


OU.wU 


1Q Qfi 


1^ ftft 


u 


Skin-Decubitus 


A1 


90 41 


17 ^1 


n 
u 


Synovium 

w vi ivviuiii 


40 


1Q **9 


1Q 1fi 

iy. id 


n 
u 


BM-MNG 


40 


, 18.21 


20.27 


0 


Activated PBMC 


38.88 


17.5 


19.88 


0 


Neutrophils 


40 


18.38 • 


20,11 


0 


Megakaryocytes 


40 


18.09 


20.39 


0 


Erythroid 


40 


21.23 


17.25 


0 



T 



Tissue Distribution of Human 67076 

A human vessel panel was tested revealing highest expression of human 67076 
, ,mRNA in normal aorta, diseased artery, and static HUVEC (see Table 4, below). 



TABLE 4. 





Moo in 


is z nnean 


do \Jt 


Expression 


Aortic SMC 


95 5ft 


91 1fi 


A AO 




Coronary SMC 


29 11 


94 *\F> 


A 7fi 


oc one 


Huvec Static 


23 55 


90 


9 QR 




Huvec LSS 


23.41 


90 06 




Qft 07^ 


H/Adipose/MET 8 


27.7 


20 51 


7 1ft 


R R701 

\J.Ol £.0 


H/Artery/Normal/Carotid/CLN 595 


26.82 


19.34 


7.48 


5.6014 


H/Artery/Normal/Carotid/CLN 598 


28.79 


20.41 


8.37 


3.0226 


H/Artery/normal/NDR 352 


29.41 


21.68 


7.73 


4.7102 


H/IM Artery/Normal/AMC 73 


32.65 


23.77 


8.88 


2.1152 


H/Muscular Artery/Normal/AMC 236 


29.2 


23.34 


5.87 


17.1577 


H/Muscular Artery/Normal/AMC 254/ 


29.68 


22.56 


7.13 


7.1393 


4/Muscular Artery/Normal/AMC 259 


29.63 


22.25 


7.37 


6.0452 


H/Muscular Artery/Normal/AMC 261 


30.12 


22.67 


7.45 


5.7191 


H/Muscular Artery/Normal/AMC 275 


30.2 


24.2 


6 


15.6792 


H/Aorta/Diseased/PIT 732 


30.73 


22.36 


8.38 


3.0121 


H/Aorta/Diseased/PIT 710 


29.6 


23.14 


6.46 


11.3199 


H/Aorta/Diseased/PIT 71 1 


29.35 


22.63 


6.72 


9.4531 


H/Aorta/Diseased/PIT 712 


28.77 


22.02 


6.75 


9.2585 


H/Artery/Diseased/iliac/NDR 753 


26.11 


19.41 


6.71 • 


9.585 


H/Artery/Diseased/Tibial/PIT 679 


29.82 


20.34 


9.47 


1.4101 


H/Vein/Normal/SaphenousAMC 107 


31.66 


21.07 


10.59 


0.6488 


H/Vein/Ndrmal/NDR 239 


33.13 


21.65 


11.49 


0.3477 


H/Vein/Normal/Saphenous/NDR 237 


29.71 


20.59 


9.12 


1.7972 


H/Vein/Normal/PIT 1010 


28.34 


22.05 


6.3 


12.6914 


HA/ein/Normal/AMC 191 


28.64 


22.15 


6.49 


11.164 


H/Vein/Normal/AMC 130 


27.41 


21.27 


6.14 


14.1309 


H/Vein/Normat/AMC 188 


30.56 


24.09 


6.46 


11.3199 


H/Vein/Normal/AMC 196 


29.89 


20.93 


8.96 


2.008 


H/Vein/Normal/AMC211 


32.55 


23.52 


9.03 


1.9196 


H/Vein/Normal/AMC 214 


30.93 


22.99 


7.95 


4.058 


M/Artery/Diseased/CAR 1 174 


24.56 


23.05 


1.5 


352.3302 


M/Artery/Diseased/CAR 1 1 75 


24.98 


19.89 


5.09 


29.2585 
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M/Aorta/Norfnal/PRI 286 . 


25.52 


18.68 


6.84 


8.7288' 


M/Artery/Normal/PRI 324 , ► " 


25.1.3 


20.65 ' 


4.48 ' 


44.8111 


M/Aorta/Normal/PRI 264 


24.14 


24.74 


-0.6 


1515.7166 


M/Artery/Normal/PRI 320 ' ' 


24.93 


20.29 


4.64 


40.1071 


MA/ein/Normal/PRI 328 


26.67 


20.04 


6.63 


.10.0965 


HUVEC Vehicle 


26.64 


21 


5.63 


20.1232 


HUVEC Mev 


25.54 


20.3 


5.25 


26.3692 


HAEC Vehicle 


25.7 


20.66 


5.04 


30.2903 


HAEC Mev 


27.84 


.22.41 


5.43 


23.1957 



Tissue Distribution pf Human 67102 

5 A human tissue panel was tested revealing highest expression of human 67102 

mRNA in normal kidney tissue and diseased artery (see Table 5, below). 



TABLES. ' 

Tissue Type , . Mean ft 2 Mean BB Ct Expression 



Aortic SMC 


28.34 


21.88' 


6.47 i 


11.2807 


Coronary SMC » 


29.98 


23.11 


6.88 


8.5196 


Huvec Static j 


27.55 


21.41" 


6.14 


14.18 


Huvec LSS 


27.72 


21.12 


6.59 


10.3444 


H/Adipose/MET 8 


30.56 


20.57 


9.99 


0.9834 


H/Artery/Normal/Carotid/CLN 595 


31.42 


20.3 


11.13 


0.4478 


H/Artery/Normal/Carotid/CLN 598 


32.23 


21.69 


10.54 


0.6717 


H/Artery/normal/NDR 352 


31.34 


.22.44 


8.9 


2.0933 


h/im Artery/ no rmai/AML* fo 


33.46 


23.98 


9.48 


1.4003 


H/Muscular Artery/Normal/AMC 236 


30.48 


23.52 


6.96 


8.0321 


H/Muscular Artery/Normal/AMC 247 


33.9 


24.07 


9.82 


1.1025 


H/Muscular Artery/Normal/AMC 254/ 


31.12 


23.43 


7.68 


4.8594 


H/Muscular Artery/Normal/AMC 259 


30.47 


23.07 


7.4 


5.9208 


H/Muscular Artery/Normal/AMC 261 


31.32 


22.92 


8.4 


2.9501 


H/Muscular Artery/Normal/AMC 275 


31.31 


24.78 


6.53 


10.8212 


H/Aorta/Diseased/PIT 732 


31.73 


22.76 


8.97 


1.9942 


H/Aorta/Diseased/PIT 710 


30.33 


23.36 


6.97 


7.9767 


H/Aorta/Diseased/PIT 711 


31.02 


23.3 


7.72 


4.7265 


H/Aorta/Diseased/PIT 712 


,30.57 


22.71 


7.86 


4.3043 


H/Artery/Diseased/iliac/NDR 753 


27.22 


20.07 


7.15 


7.041 


H/Artery/Diseased/Tibial/PIT 679 


32 


21.19 


10.81 


0.557 


H/Vein/Normal/SaphenousAMC 107 


31.57 


22.08 


9.49 


1.3859 


H/Vein/Normal/NDR 239 


33.44 


22.16 


11.28 


0.4021 


H/Vein/Normal/Saphehous/NDR 237 


31.32 


21.01 


10.31 


0.7877 


HA/ein/Normal/PIT 1010 


29.86 


22.36 


7.5 


5.5243 


HA/ein/Normal/AMC 191 


30.36 


22.53 


7.84 


4.3796 


HA/ein/Normal/AMC 130 


30.08 


22.32 


7.75 


4.6293 


HA/ein/Normal/AMC 188 


32.93 


25.01 


7.92 


4.129 


H/Vein/Normal/AMC 196 


32.24 


21.61 


10.64 


0.6288 


rWein/Normal/AMC211 


36.16 


23.51 


12.65 


0 


HA/ein^Jormal/AMC 214 


35.59 


24 


11.6 


0 


M/Artery/Diseased/CAR 1175 


29.73 


21.84 


7.89 


4.2011 


M/Aorta/Normal /543 


34.84 


29.17 


5.67 


19.6408 


M/Artery/Diseased/CAR 1174 


31.11 


26.59 


4.52 


43.5857 



147 

BNSDOCID: <WO_02055701 A2_l_> 



WO 02/055701 



PCTAJS01/49060 



M/Pancreas/PRI 2 


32.48 


26.33 


6.15 


14.082' 


M/Kidney/Normal/MPI 88 v , „ 


30.23 


26.84 • 


3.38 , 


96.0547 


M/Kidney/Normal/MPI 282 


29.34 


25.94 


3.4 


95.0612 


HUVEC Vehicle • • i 


29.25 s 


21.45 


7.8 , r 


4.4871 


HUVEC Mev 


28.16 


20.87 


7.29 


6.3899 


HAEC Vehicle 


28.14 


.21.16 


6.97 


7.9491 


HAECMev 


29.61 


22.66 


6.95 


8.088 



In addition, a human vessel panel was tested, which revealed high expression of 
human 67102 mRNA in normal artery, HUVEC, coronary smooth muscle cells, diseased 
5 aorta, and normal* hypothalamus (see Table, 6, below). 

TABLE 6. 



Tissue Type 


Mean 


(J 2 Mean 


aact 


Expression 


Artery normal 


27.32 


21.75 


5.57 


21.0505 


Aorta diseased 


28.27 


21.71 • 


6.55 


10.6353 


Vein normal 


30.38 


19:83 


10.55 


0.6ei93 


Coronary SMC 


28.61. 


22.23 


6.38 


12.0485 


HUVEC 


,26!32 


20.32 


6 


15.5709 


Hemangioma 


25.91 


19.07 


6.83 • 


8!7895 


Heart normal 1 


27.16 


19.98 


7.17 


6.9441 


Heart CHF 


27.2 


19.06 


8.14 


3.545 


Kidney 


25.54 


19.59 


5.96 


16.12 


Skeletal Muscle 


30.52 


21.5 


9.03 


1.9196 


Adipose normal 


30.11 


19.95 


10.15 


0.8771 


Pancreas 


29.57 


21.23 


8.33 


3.1076 


primary osteoblasts 


28.09 


19.85 


8.23 • 


3.3191 


Osteoclasts (diff) 


29.79 


17.02 


12.77 


0.1432 


Skin normal 


29.31 


21.41 


7.89 


4.2011 


Spinal cord normal 


28.3 


20.36 


7.93 


4.0863 


Brain Cortex normal 


28.25 


22.04 


6.21 


13.5084 


Brain Hypothalamus normal 


28.93 


21.49 


7.44 , 


5.7589 


Nerve 


28.34 


21.3 


7.04 


7.5989 


DRG (Dorsal Root Ganglion) 


29.16 ' 


21.11 


8.04 


3.7994 


Breast normal 


27.81 


20.47 


7.34 


6.1508 


Breast tumor 


29.08 


20.41 


8.68 


2.4466 


Ovary normal 


26.44 


19.7 


6.74 


9.3878 


Ovary Tumor 


30.93 


19.6 


11.34 


0.3871 


Prostate Normal 


28.11 


19.48 


8.63 


2.5241 


Prostate Tumor 


27.68 


19.68 


8 


3.9063 


Salivary glands 


28.9 


19.18 


9.71 


1.194 


Colon Tumor 


27.98 


18.82 


9.16 


1.742 


Lung normal 


26.96 


17.4 


9.56 


1.3202 


Lung tumor 


27.82 


19.64 


8.19 


3.4361 


Lung COPD 


26.38 


17.66 


8.72 


2.3633 


Colon IBD 


28.27 


17.29 


10.98 


0.4934 


Liver normal 


29.14 


19.58 


9.56 


1.3248 


Liver fibrosis 


29.89 


21.08 


8.8 


2.2358 


Spleen normal 


26.95 


19.09 


7.86 


4.3193 


Tonsil normal 


25.01 


16.8 


8.21 


3.3654 


Lymph node normal 


26.3 


18.22 


8.09 


3.6828 


Small intestine normal 


29.03 


19.59 


9.45 


1.4347 
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Skin-Decubitus 


27.66 


20.32 


7.34 


6.1722 


Synovium 


28.22 


19.23 


'8.98 


1.9804 


BM-MNC 


29.57 


18.46 


11.12 


0.4509 


Activated PBMC 


28.38 ' 


17.25 ' 


'11.14 


0.4447 


Neutrophils 


27.43 


18.4 


9.04 


1.8997 


Megakaryocytes 


26.72 


17.88 


8.84 


2.1822 


Erythroid 


31.52 


21.26 


10.26 


0.8183 


Colon normal 


30.07 


19.25 


10.82 


0.5551 



Tissue Distribution of Hump 44181 

5 ( A human vessel panel was tested revealing highest expression of human 44 1 8 1 

mRNA in LSS HUVEC (see Table 7, below). 



• TABLE 7 



Tissue Type 


Mean 


R 2 Mean 


aact 


Expression 


Static Huvec 


25.37 


19.18 


6.19 


13.697 


LSS Huvec 


25.7 


20.02 


5.68 


19.4377 


Aortic SMC 


28.75 


20.32 


8.43 


2.9095 


Coronary SMC 


28.52 


21.2 


7.31 , 


6.3019 


H/Adipose/MET 9 


36.07 


18.41 


17.66 


0 


Diseased Heart /PIT 1 


29.28 


21.15 


8.13 


3.5697 


H/Artery/Normal/Carotid/CLN 
595 


37.9 


18.32 


19.59 


0 


H/Artery/Normal/Carotid/CLN 
598 


39.97 


19.49 


20.48 


0 


H/Artery/normal/NDR 352 


40 


20.2 


19.8 


0 


H/Artery/Normal/AMC 150 


40 j 


22.27 


17.73 


0 


H/Artery/Normal/AMC 73 


40 


23.84 


16.16 


0 


IMA/AMC247 


39.73 


22.79 


16.95 


0 


IMA / AMC 254 


33.79 


22.23 


11.56 


0.3324 


IMA/AMC 259 


33.68 


21.12 


12.56 


0.1656 


IMA / AMC 261 


34.73 


21.23 


13.5 


0.0863 


IMA/AMC 275 


40 


24.52 


15,48 


0 


IMA /AMC 279 


30.89 


22.41 


8.48 


0 


H/Artery/Diseased/iliac/NDR 
753 


36.59 


18.43 


18.16 


0 


H/Artery/Diseased/Tibial/PIT 
679 


40 


19.84 


20.16 


0 


Aorta/Diseased/PIT 732 


34.74 


21.32 


13.41 


0.0916 


Aorta/Diseased/ PIT 710 


33.04 


22.48 


10.56 


0.6624 


Aorta/Diseased/P IT 711 


31.89 


22.09 


9.8 


1.1218 


Aorta/Diseased/ PIT 712 


32.92 


22.09 


10.84 


0.5474 


HA/ein/Normal/Saphenous/N 
DR 721 


32.66 


16.82 


15.83 


0.0172 


HA/ein/Normal/SaphenousA 
MC 107 


40 


20 


20 


0 


HA/ein/Nprmal/NDR 239 


40 


20.61 


19.39 


0 


HA/ein/Normal/Saphenou s/N 
DR 237 


40 


19.1 


20.9 


0 


HA/ein/Normal/NDR 235 


40 


21.34 


18.66 


0 
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HUVEC/Vehicle/24 hr 


30 5 


.1 v_>. I «_/ 


19 ^6 




HUVEC/Mev/24hr/1X 


30.28 


17.52 


12.76 


0.1442 


HUVEC/MEV/24HR/2.5X 


29.35 


,19.18 


10.18 


0.865 


HUVEC/MEV/48HR/1X 


35.68 


21.54 


14.14 


0 


HUVEC/MEV/48HR/2.5X 


34.7 


23 ' 


11,7 


0.3016 



1 ; • 

, Tissue Distribution of Human 67084 \ 

5 A humaa vessel panel was tested revealing highest expression of human 67084 

mRNA in HUVEC, LSS HUVEC, and coronary smooth muscle cells (see Table 8, below). 



TABLE 8. 



Tissue Type 


Mean 


& 2 Mean 


3d Ct 


Expression . 


Aortic SMC 


25.92 


19.23 


6.7 


9.6517 


Coronary SMC 


26.59 


20.36 


6.23 


13.3224 


Huvec Static 


23.39 


185 


4.88 


33.843 


Huvec LSS 


24.31 


18.32 


5.99 


15.7883 


H/Adipose/MET 9 


26.4 


18.46 


7.94 


4.0721 


H/Artery/Normal/Carotid/CLN 595 


26.83 


18-84 


8 


3.9198 


H/Artery/Normal/Carotid/CLN 598 


28.49 


20.16 


8.34 


3.0968 


H/Artery/normal/NDR 352 


27.12 


20.32 


6.8 


8.9432 


H/IM Artery/Normal/AMC 73 


31.48 


23.36 


8.12 


3.607 


H/Muscular Artery/Normal/AMC 236 


30.93 


23.56 


7.38 


6.0243 


H/Muscular Artery/Normal/AMC 247 


33,77 


24.84 


8.92 


2.0645 


H/Muscular Artery/Normal/AMC 254/ 


30.69 


23.68 


7 


7.7855 


H/Muscular Artery/Normal/AMC 259 


29.9 


22.12 


7.78 


4.5497 


H/Muscular Artery/Normal/AMC 261 


29.93 


21.13 


8.8 


2.2436 


H/Muscular Artery/Normal/AMC 275 


30.29 


22.97 


7.33 


6.2367 


H/Aorta/Diseased/PIT 732 


29.02 


21.35 


7.67 


4.8932 


H/Aorta/Diseased/PIT 710 


31.36 


22.8 


8.56 


2.6496 


H/Aorta/Diseased/PIT 71 1 


31.31 


22.6 


6.71 


2.3963 


H/Aorta/Diseased/PIT 712 


31.4 


22.48 


8.92 


2.0645 


H/Artery/Diseased/iliac/NDR 753 


25.37 


17.73 


7.64 


4.996 


H/Artery/Diseased/Tibial/PIT 679 


28.55 


19.45 


9.11 


1.816 


HA/ein/Normal/SaphenousAMC 107 


29.48 


.21.11 


8.38 


3.0121 


HA/ein/Norma!/Saphenous/NDR 237 


28.67 


19,86 


8.8 


2.2358 


HA/ein/Normal/PIT 1010 


28.31 


20.55 


7.76 


4.5973 


HA/ein/Normal/AMC 191 


29.25 


20.77 


8.47 


2.8104 


HA/ein/Normal/AMC 130 


28.32 


20.45 


7.88 


4.2598 


HA/ein/Normal/AMC 188 


31.68 


24.61 


7.06 


7.4943 


HA/ein/Normal/NDR 239 


35.65 


29.23 


6.42 


0 


HUVEC Vehicle 


26.86 


20.14 


6.71 


9.5188 


HUVEC Mev 


25.83 


18.52 


7.3 


6.3238 


HAEC Vehicle 


26.57 


19.64 


6.94 j 


8.1443 


HAEC Mev 


27.81 


21.13 


6.67 


9.7864 
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EXAMPLE 3: ^EXPRESSION OF RECOMBINANT 67076, 67102, 44181, 
67084FL, or 67084alt POLYPEPTIDE IN BACTERIAL CELLS 

• • • . 

In this example, human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, • 
or 67084alt is expressed as a recombinant glutathione-S-transferase (GST) fusion 
polypeptide in R coli and f the fusion polypeptide is .isolated and characterized/ Specifically, 
8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 670&4alt is fused to GST 
and this fusion polypeptide is expressed in £. coli, e.g. , strain PEB 1 99. Expression of the 
GST-PLTR! fusion polypeptide iji PEB 199 is induced with IPTG. The recombinant fusion 
polypeptide is purified from crude bacterial lysates of the induced PEB 1 99 strain by affinity 
chromatography on glutathione beads. Using polyacrylamide gel electrophoretic analysis of 
the polypeptide purified from the bacterial lysates, the molecular t weight of the resultant 
fusion polypeptide is determined. 

i * - 

EXAMPLE 4,: EXPRESSION OF RECOMBINANT 8099, 46455, 54414, 

53763, 67076, 67102, 44181, 67084FL, or 67084alt 
POLYPEPTIDES IN COS CELLS 

To express the human 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, 
or 67084alt gene in COS cells, the pcDNA/Amp vector by Invitrogen Corporation (San 
Diego, CA) is used. This vector contains an SV40 origin of replication, an ampicillin 
resistance gene, an R coli replication origin, a CMV promoter followed by a polylinker 
region, and an SV40 intron and polyadenylation site. A £>NA fragment encoding the entire 
PLTR polypeptide and an HA tag (Wilson et al (1984) Cell 37:767) or a FLAG tag fused 
in-frame to its 3' end of the fragment is cloned into the polylinker region of the vector, 
thereby placing the expression of the recombinant polypeptide under the control of the CMV 
promoter. 

To construct the plasmid, the human 8099, 46455, 54414, 53763, 67076, 67102, 
44181, 67084FL, or 67084alt DNA sequence is amplified by PCR using two primers. The 
5' primer contains the restriction site of interest followed by approximately twenty 
nucleotides of the 8099, 46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt 
coding sequence starting from the initiation codon; the 3 ! end sequence contains 
complementary sequences to the other restriction site of interest, a translation stop codon, 
the HA tag or FLAG tag and the last 20 nucleotides of the 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt coding sequence. The PCR amplified fragment 
and the pCDNA/Amp vector are digested with the appropriate restriction enzymes and the 
vector is dephosphorylated using the CIAP enzyme (New England Biolabs, Beverly, MA). 
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Preferably the two restriction sites chosen are different so that the 8099, 46455, 54414, 
53763, 67076, 67102, 44181, 67084FL,or 67084alt gene is inserted in the correct 
orientation. ' The ligation mixture is transformed into & coli cells (strains HB101, DH5a> 
SURE, available from Stratagene Cloning Systems, La' Jolla, CA, can be used), the 
5 transformed culture is plated on ampicillin media plates, and resistant colonies are selected. 
Plasmid DNA is isolated from transformants and examined by restriction analysis for the 
presence of the correct fragment. 

COS cells are subsequently transfected with the human 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt -pcDNA/Amp plasmid DNA using the calcium 

10 phosphate or calcium chloride co-precipitation methods, DEAE-dextran-mediated 

transfection, lipofection, or electroporation. Other suitable methods for transfecting host 
cells can be found in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A 
, .Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, 1989. The expression of the IC54420 

15 polypeptide is detected by radiolabelling ( 35 S-methionine or 35 S-cysteine available from 
NEN, Boston, MA, can be used) and immunoprecipitation (Harlow, E. and Lane, D. 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1 988) using an HA specific monoclonal antibody. Briefly, the cells are labeled 
for 8 hours with 35 S-methionine (or 35 S-cysteine). The culture media are then collected 

20 and the cells are lysed using detergents (RIPA buffer, 150 mM NaCl, 1% NP-40, 0.1% SDS, 
0.5% DOC, 50 mMTris, pH 7.5). Both the cell lysate and the culture media are precipitated 
with an HA specific monoclonal antibody. Precipitated polypeptides are then analyzed by 
SDS-PAGE. 

Alternatively, DNA containing the human 8099, 46455, 54414, 53763, 67076, 
25 67 1 02, 44 1 8 1 , 67084FL, or 67084alt coding sequence is cloned directly into the polylinker 
of the pCDNA/Amp vector using the appropriate restriction sites. The resulting plasmid is 
transfected into COS cells in the manner described above, and the expression of the 8099, 
46455, 54414, 53763, 67076, 67102, 44181, 67084FL, or 67084alt polypeptide is detected 
by radiolabelling and immunoprecipitation using a 8099, 46455, 54414, 53763, 67076, 
30 67102, 44181, 67084FL, or 67084alt -specific monoclonal antibody. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
35 described herein. Such equivalents are intended to be encompassed by the following claims. 
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What is claimed: 

1. An isolated nufcleic acid molecule selected from the group 'consisting of: 

5 (a) a nucleic acid molecule comprising the nucleotide sequence set forth in 

SEQ ID NO:l , SEQ ID NO:4, SEQ ID NO:7, SEQ ID NO: 10, or SEQ ID NO: 13, SEQ ID 
NO:16, SEQ ID NO:19, SEQ ID NO:22, or SEQ ID NO:25; and 

(b) a nucleic acid molecule comprising the nucleotide sequence set forth 'in 
SEQ ID NO:3, SEQ ID NO:6, SEQ ID NO:9, SEQ ID NO:12, or SEQ ID NO:15, SEQ ID 

10 NO: 1 8, SE0 ID NO:2i , SEQ ID, NO:24, or SEQ ID NO:27. 

2. ' An isolated nucleic acid molecule which encodes a polypeptide comprising 
the amino acid sequence set forth in SEQ ID NO:2^ SEQ ID NO:£, SEQ ID NO:8, SEQ ID 
NO:l 1, SEQ ID NO: 14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID 

15 NO:26. - 

' ' . i 

3. An isolated nucleic acid molecule comprising the nucleotide sequence 

contained in the plasmid deposited with ATGC® as Accession Number , , 

, ,or 



20 



25 



4. An isolated nucleic acid molecule which encodes a naturally occurring allelic 
variant of a polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2, 
SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1 , SEQ ID NO: 14, SEQ ID NO:17, SEQ ID 
NO:20, SEQ ID NO:23, or SEQ ID NO:26. 

■ - 

5. An isolated nucleic acid molecule selected from the group consisting of: 



a) a nucleic acid molecule comprising .-a nucleotide sequence which is at 
least 60% identical to the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID 

30 NO:4, SEQ ID NO:6, SEQ ID NO:?, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ 
ID NO:13, SEQ IDNO:15, SEQ IDNO:I6, SEQ ID NO:18, SEQ ID NO: 19, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27, or a complement 
thereof; 

b) a nucleic acid molecule comprising a fragment of at least 30 

35 nucleotides of a nucleic acid comprising the nucleotide sequence of SEQ ID NO: 1 , SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ 
ID NO:12, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO: 18, SEQ ID 
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NO:19, SEQ IDNO:21, SEQ IDNO:22, SEQ ID NO:24, SEQ IQ NO:25, SEQ ID NO:27, 
or a complement thereof; 

c) a nucleic acid molecule which fencodes a polypeptide comprising an 
amino acid sequence at least about 60% identical to the amino acid sequence of SEQ ID 

5 NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO: 11, SEQ ID NO:14, SEQ ID NO: 17, SEQ 
ID NO:20, SEQ ID NO:23, or SEQ ID NO:26; and 

d) a nucleic acid molecule which ^ encodes a fragment of a polypeptide 
comprising the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SfeQ 
ID NO: 11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID 

10 NO:26, wherein the fragment comprises at least 10 contiguous amino acid residues of the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:5; SEQ ID NO:8, SEQ IDNOrl l, SEQ 
ID NO: 1 4, SEQ ID NO: 1 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26. 

, -. 

6. An isolated nucleic acid molecule which hybridizes to a complement of the 
1 5 nucleic acid molecule of any .on'e of claims 1 , 2, 3, 4, or 5 unde^ stringent conditions. 

i . - ■ 

7. An isolated nucleic acid molecule comprising a nucleotide sequence which is 
complementary to the nucleotide sequence of the nucleic acid molecule of any one of claims 
1,2,3,4, or 5. 

20 • , 

8. An isolated nucleic acid molecule comprising the nucleic acid molecule of 
any one of claims 1, 2, 3, 4, or 5, and a nucleotide sequence encoding a heterologous 
polypeptide. 

25 9. A vector comprising the nucleic acid molecule of any one of claims 1 , 2, 3, 4, 

or 5. 

1 0. The vector of claim 9, which is ap expression vector. 

30 11. A host cell transfected with the expression vector of claim 10. 

12. A method of producing a polypeptide comprising culturing the'host cell of 
claim 1 1 in an appropriate culture medium to, thereby, produce the polypeptide. 

35 13. An isolated polypeptide selected from the group consisting of: 

a) a fragment of a polypeptide comprising the amino acid sequence of 
SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:l 1, SEQ ID NO: 14, SEQ ID 
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NO: 17, SEQ ID NO:20,.SEQ ID NO:23, or SEQ ID NO:26, wherein the fragment comprises 
•"' at least 10 contiguous amino acids of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID 
NO: 1 1 , SEQ ID NO: 1 4, SEQ ID NO:! 7, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID 
NO:26; , 
5 b) a naturally occurring allelic variant of a polypeptide comprising the. 

amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:ll, SEQ 
ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26, wherein the 
polypeptide is encoded by a nucleic acid molecule which hybridizes to a complement of a 
nucleic acid molecule consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
10 NO:6, SEQ ID NO:7, SEQ ID'NO:9, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, SEQ 
• ID NO: 1 5, SEQ ID NO: 1 6, SEQ ,ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:2 1 , SEQ ID ' 
NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27 under stringent conditions; 

c) a polypeptide which is encoded by a nucleic acid molecule 
comprising a nucleotide sequence which is at least 60% identical to a nucleic acid 

1 5 comprising the nucleotide sequence of SEQ ID NO: 1 , SEQ ID NQ:3, SEQ ID NO:4, SEQ 
IDNO:6, SEQ IDNO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ IDNO:12, SEQ ID NO:13, 
SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO: 1 9, SEQ ID NO:2 1 , SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:27; and 

d) a polypeptide comprising an amino acid sequence which is at least 
20 60% identical to the amino acid sequence of SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO:8, 

SEQ IDNO:ll, SEQIDNO:14, SEQ ID NO:17, SEQ IDNO:20, SEQ IDNO:23, or SEQ 
1DN0:26. 

14. The isolated polypeptide of claim 1 3 comprising the amino acid sequence of 
25 SEQ ID NO:2, SEQ ID NO:5, SEQ ID NO: 8, SEQ ID NO:l 1, SEQ ID NO:14, SEQ ID 

NO: 17, SEQ ID NO:20, SEQ ID NO:23, or SEQ ID NO:26. 

15. The polypeptide of claim 1 3 , further comprising heterologous amino acid 
sequences. 



30 



16. An antibody which selectively binds to a polypeptide of claim 1 3 . 



17. A method for detecting the presence of a polypeptide of claim 13 in a sample 
comprising: 

35 a) contacting the sample with a compound which selectively binds to the 

polypeptide; and 

b) determining whether the compound binds to the polypeptide in the 
sample to thereby detect the presence of a polypeptide of claim 13 in the sample. 
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18. The method of claim 1 7, wherein the compound which binds to the 
polypeptide is an antibody. .. -V 

5 19. A kit comprising a compound which selectively binds to a polypeptide of 

claim 13 and instructions for use. 

, i * " 

20. A method for detecting the presence of a nucleic acid molecule of any one of 

claims 1, 2, 3, 4, or 5 in a sample comprising: 

10 a) contacting the sample with a nucleic acid probe or primer which 

selectively hybridizes to a complement of the nucleic acid molecule; and 

b) determining whether the nucleic acid probe or primer binds to the 
, , complement of the nucleic acid molecule in the sample to thereby detect the presence of the 
nucleic acid molecule of any one of claims 1, 2, 3, 4, or 5 in the sample. 
15 , 

21. The method of claim 20, wherein the sample comprises mRNA molecules 
and is contacted with a nucleic acid probe. » 

22. A kit comprising a compound which selectively hybridizes to a complement 
20 of the nucleic acid molecule of any one of claims 1, 2, 3, 4, or 5 and instructions for use. 

23. A method for identifying a compound which binds to a polypeptide of claim 
13 comprising: 

a) contacting the polypeptide, or a cell expressing the polypeptide with a 
25 test compound; and 

b) determining whether the polypeptide binds to the test compound. 

24. The method of claim 23, wherein the binding of the test compound to the 
polypeptide is detected by a method selected from the group consisting of: 

30 a) detection of binding by direct detection of test compound/polypeptide 

, binding; 

b) detection of binding using a competition binding assay; and 

c) detection of binding using an assay for 8099, 46455, 54414, 53763, 
67076, 67102, 44181, 67084FL, or 67084alt activity. 

25. A method for modulating the activity of a polypeptide of claim 1 3 
comprising contacting the polypeptide or a cell expressing the polypeptide with a compound 
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which binds to the polypeptide in a sufficient concentration to mgdulate the activity of the 
polypeptide. 

• . . , • . • . • , r 

• • • * » i - 

26. 1 A method for identifying a compound which modulates the activity of a 

5 polypeptide of claim 13 comprising: 

a) contacting a polypeptide of claim 1 3 with a test compound; and . 

b) determining the effect of the test compound on the activity of |he 
polypeptide to thereby identify, a compound which modulates the activity of the polypeptide. 
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CCACGCGTCCGGCCTTCCGAAATXG^^ 

CTTCTTTAGCATGCTATTATQGGGAPJKTGte 

1 M V P V E N T E ' G P S h L N 14 

GGCTGTAACTTCTACGTGACC ATG. GTA CCT GTT GAA. AAC ACC dAG GGC CCC AGT CTG CTG AAC 4 2 

Q K G T A V E= T E G S G S R H P p W A R 34 

CAG AAG GGG ACA GCC GTG GAG ACG GAG QGC AGC GGC AGC CGG CAT CCT CCC TGG GCG AGA 102 

G C G M F T i F L S S V T ( A A V S G L L , V 54 

GGC TGC GGC ATG TTT ACC TTC I CTG TCA TCT GTC ACT GCT GOT GTC AGT GGC CTC CTG GTpG 162 

G Y E L »G I I S G A L L Q I K T L L A L 74 

GGT TAT GAAj CTT GGG ATC ATC TCT GGG GCT CTT CTT CAG ATC AAA ACC TTA TTA GCC CTG 222 

1 1 

S C H E O E M V V S S L VI G A L L A S 94 

AGC TGC CAT GAG CAG GAA ATG GTT GTG AGC TCC CTC GTC ATT GGA GCC CTC CTT GCC TCA . 282 

L T G G V L I D R Y G R R T A I I L S * S 114 

CTC ACC GGA GGG GTC CTG ATA GAC AGA TAT GGA . AGA AGG ACA GCA ATC ATC TTG TCA TCC }42 

C L L G L G S L V X, 1 1 L» S L S Y T V L I 134 

TGC CTG CTT GGA CTC GGA AGC TTA; GTC TTG ATC CTC AGT TTA TCC TAC ACG GTT CTT ATA 402 

I ♦ 

V G R I A I G- V S I S b S . S ♦ X A T , C V Y 154 
GTG GGA CGC AT^T GCC ATA GGG GTC TCC ATC TCC CTC TCT TCC ATT GCC ACT TCT GTT TAC 462 

I A E I A P Q H R • R G h L, V S L N E L M 174 

ATC GCA GAG ATT GCT CCT CAA CAC AGA AGA GGC CTT CTT GTG TCA CTG AAT GAG CTG ATG 522 

I V I G I L S A „Y ... I S N Y A F A N V F H 194 

ATT GTC ATC GGC ATT CTT TCT GCC TAT ATT TCA AAT TAC GCA TTT GCC AAT GTT TTC CAT 582 

G W K Y M ? Q Xj V I . P L G . V L Q A* I A M 214 

GGC TGG AAG TAC ATG TTT GOT CTT GTG ATT CCC TTG GGA GTT TTG CAA GCA ATT GCA ATG 642 

Y F L P P S P R F L V M K G Q E G A A S 234 
TAT TTT CTT CCT CCA AGC CCT CGG TTT CTG GTG ATG AAA GGA CAA GAG GGA GCT GCT AGC 702 

K V L G R L R A L S D T T E E L T V I K 254 

AAG GTT CTT GGA AGG TTA AGA GCA CTC TCA GAT ACA ACT GAG GAA CTC ACT GTG ATC AAA 762 

SSLKDEY QY S FW DIi FRSKDN 274 

TCC TCC CTG AAA GAT GAA TAT CAG TAC AGT TTT TGG GAT CTG TTT CGT TCA AAA GAC AAC 822 

M R T R I M I G L T L V F F V Q I T G Q 294 

ATG CGG ACC CGA ATA ATG ATA GGA CTA ACA CTA GTA TTT TTT GTA CAA ATC ACT GGC CAA 882 

P N I L F Y A S T V L K S V G F Q S N E 314 

CCA AAC ATA TTG TTC TAT GCA TCA ACT GTT TTG AAG TCA GTT GGA TTT CAA AGC AAT GAG 942 

A A S L A S T G V G V V K V I S T I P A 334 

GCA GCT AGC CTC GCC TCC ACT GGG GTT GGA GTC GTC AAG GTC ATT AGC ACC ATC CCT GCC 1002 



Figure 1A 
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TL LVD VGSK TFLC IGSCV M 354 
ACT CTT CTT GTA GAC QHI GGC AGC AAA ACA TTC CTC TG TT GGfl ffCC TCT GTG ATG 1062 

.A A S L V T M G.I V N L M I H M H F T H 374 

GCA GCT TCG TTG GTG ACC ATG GGC ATC GTA AAT CTC AAC ATC . CAC ATG AAC TTC ACC CAT 1122 ' 

I C R S H.N S I N Q S L D P S V I Y G P 394 

ATC TGC AGA AGC CAC AAT TCT ATC AAC CAG TCC TTG GAT GAG TCT GTG ATT TAT GGA CCA 1182 

G N L S T.N N N T L R D H F K G I S S H 414 

GGA AAC CTG TCA ACC AAC AAC AAT ACT CTC AGA GAC CAC TTC AAA GGG ATT TCT TCC CAT 1242 

S R SSL M P L R N JD V D K R G E T T S 434 

AGC AGA AGC TCA CTC ATG CCC CTG AGA AAT GAT GTG GAT AAG AGA C6G GAG ACG ACC TCA 13 02 

• " I 1 

A S L L N A G L S H T E Y O I V T D P G 454 

GCA TCC TTG CTA AAT GCT GGA TTA AGC CAC ACT GAA TAC CAG ATA GTC ACA GAC CCT , GGG 13 62 

D V P A F L K W I L S L A S.L. L V Y V A A 474 

GAC GTC CCA GCT TTT TTG AAA TGG CTG TCC TTA GCC AGC TTG CTT GTT TAT* GTT GCT GCT 14 2^ 

P S I G L G P M P W L V L S E I F P G G 494 

TTT TCA ATT GGT CTA GGA CCA ATG CCC TGG CTG GTG CTC AGC GAG ATC TTT CCT GGT GGG 1482 

I , R G R A M A L T S S M N W G I N h L I 514 

ATC AGA GGA CGA GCC ATG GCT TTA ACT TCT AGC ATG AAC TGG GGC ATC AAT CTC CTC ATC 1542 

S L T F L T V T D L» I G L P W V C F I Y 534 

TCG CTG ACA TTT TTG ACT GTA ACT GAT CTT ATT GGC ' CTG CCA TGG GTG TGC TTT ATA TAT 1602 

T.I M S L A S h L F V V M F I PET K G . 554 

ACA ATC ATG AGT CTA GCA TCC CTG CTT TTT GTT GTT ATG TTT AJTA CCT GAG ACA AAG GGA 1662 

C S L E Q I S M E L» A K V N Y V K N N I 574 

TGC TCT TTG GAA CAA ATA TCA ATG GAG CTA GCA AAA GTG AAC TAT GTG AAA AAC AAC ATT 1722 

C F M S H H Q E E L V P K Q P Q K R K P 594 

TGT TTT ATG AGT CAT CAC CAA GAA GAA TTA GTG CCA AAA CAG CCT CAA AAA AGA AAA CCC 1782 

Q E Q L L E C N K L. C G R G Q S R Q L S 614 

CAG GAG CAG CTC TTG GAG TGT AAC AAG CTG TGT GGT AGG GGC CAA TCC AGG CAG CTT TCT 1842 

PET* 618 

CCA GAG ACC TAA 1854 

TGGCCT^AACACCTTCTCAACGTGGATAGTGCCAGAACACTTAGGAGGGTC 

CTGTGCTCTCTTTTCAGTGTCATGGAACTGG 

CTCCCCAGAAGGAACCTCAAAAGGTAGATGA GGTACAAGGTC CTAAGTGATCTCTTTTTCTGAGCAGGATATC AGGTTA 
AAAAAAAAAAGTTACKKSCTGGTTTAAT^ 

AGACATCAACCTCCGCCTTAAGCTATGTATGTATGGAGGCCAGTCGCAGCTTTATTATGCA 

ACATGAGGGTACAGTTTCTGCCTACCAAGACACTACH*TGCACTGGATCTTAC 

GGACAACTGCCCATATATTCTATCTAGATTAGGAGAGGGTCC^^ 

ACAAGTATAAAGATTATAGAGCTTATTTTATGTVACTATAAACTATAATTTAATGCAAT^ 
GTTAATATTGTGAAATATTAAAATAATTCCGCAATAAAAAAAAAAAAAAAGGGCGGCCGC 
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Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 

HMMER jls freely distributed under the GNU General Public License (GPL) . 



HMM file: 
Sequence f ile : 



/prod/ddm/seqanal/PFAM/pfam6 .4/Pfam . 
/prod/ddm/wspace/orf ana 1/oa- script . 8089 . seq 



Query: 



8099 



Scores for sequence family classification (spore includes all domains): 
Model 



Description 
Sugar (and other) transporter 



sugar_tr 
1 

, FecCD_family FecCD transport family 

1» . 

MCT Monbcarboxylate. transporter 

1 

Parsed for domains: 



Score 

318.2 
-218.2 
-235.8 



E-value N 
9.6e-92 
6.9 
2.7 



Model 


Domain 


seq-f 


seq-t 


hmm- f 


hmm- 1 




score 


E-value 


FecCD_f ami 1 y 


1/1 


26 


227 . 


1 


' 311 


U 


-218.2 


6.9 


sugar_tr 


1/1 


43 


564 . 


. 1 


488 


u 


318.2 


9.6e-92 


MCT 


1/1 


29 


567 . 


1 


611 


u 


-235.8 


2.7 



Alignments of top-scoring domains: 

FecCD_family: domain 1 of 1, from 26 to 227 : score -218. 2, E = 6.9 

* - >Gal s i spadvlqa 1 f ggg tege ievde 1 i iwdl t 1 rRLPRvLlAlLV 
G+++ ++a ++ + ++ ++ + +1LV 

8099 26 GSRHPPWARGCGMFT FLSS VTAA- VSGLLV 54 

GAaliAVaGAilQgltRNPiAsPgilGinsGAslgwlaivlfpgglsisa 
G + lGi sGA 1 + ++1 + + + + 

8099 55 G - - - YELG 1 I SG ALLQ I KTLLALSCH- EQEMV 82 

lyllpsf Af aGaliaallVyllawkgrnglspvrLiLaGialsalf sAlt 
» +1+++A++ +1++++1+ +++ + i+ls+++ +1 

8099 83 VSSLVIGALLASLTGGVLIDRYGRR---- TAIILSSCLLGLG 120 



8099 



tllllisddlqdqqalfWltGSlsgrnWedvklalpilliglplalllar 
+1 1+ls + +++ + gr v + 1 ++ + +a + 

121 SDVLILSLS YTVL - 1 VGRIAIGVS ISLSS IATCVYIAEI - - - 



158 



qLnvLsLGddt AkgLGvnvervR .1111 1 1 wlLtGaaVAvAGp IgFVGL 
+ + + R+11++1 +++ '+G+ 
8099 159 APQHRRgLLVS LNELMI V IGI 179 

ivPHiaRrLvGt . dhrwLLPaS ALlGAiLLl 1AD1 1 AR 1 1 f aPiEl PvGi 
+ +i h w ++++ + +P G+ 

8099 180 LSAYISNYAFANvFHGW : - KYMFGLVIPLGV 208 



8099 



vTAliGaPyFl YLLrr<- 

+ A+ a+yFl + + + + ++L-t>++ 
209 LQAI - -AMYFLppspr FLVMK 



227 



sugar_tr: domain 1 of 1, from 43 to 564 : score 318.2, E = 9.6e-92 

* - >valvaalgGgf If GyDtgviggf lalidf If rfglltssgalaelvg 
+++aa+ G. +1 Gy +g+i+g+l +i+ 1 s+ ++ 
8099 43 SSVTAAVSG- LLVGYELGI ISGALLQIK TLLALS CHEQE 80 
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ystvltglwsif f lGrliGslfaGklgdrfGRkksllialvlfviGall 
+ws + + +G+1 + si +G l+dr+GR+ .+ + + +.+ + + 1 . G+1 + 
81 MWS S L V I GALLAS LTGG VL I DR YGRRTA 1 1 LS S CLLG LGS LV 123 

sgaapgytTiGlwafyllivtfRvlvGlgvGgasvlvPmYisEiAPkalRG 
+++ ++ +livGR+ +G ++ 4 s ++ +Yi+EiAP + RG 

124 LILSL SYTVLIVGRIAIGVSISLSSIATCVYIAEIAPQHRRG 16*5 



algslyqlaitiGilvAaiiglglnktnndsalnswgWRiplglqlvpal 
l+sl++l+i+iGil A+i+++++++++ gW+ ++gl + ■*■++ 

•8099 166 LLVSLNELMIVIGILSAYJSNYAFANVFH---- GWKYMFGLVI PLGV 208 

llligllf lPESPRwLvekgkleeA3fevLaklrgvedvdqeiqeikaele 
1++1++ flP SPR+Lv+kg++ A +vL +lr +d+++e+ ik +1+ 
8099 209 LQAIAMYFLPPSPRFLVMKGQEGAASKVLGRLRALSDTTEEIiTVIKSSLK 258 

atvseekagkaswgelfrgrtrpkvrqrllm^r^lqafc^ltGiNaifYY 
+ + +++ + +lfr++ +r r*+++g +1 +f Q tG i +Y 

8099 259 DEYQYS - -'-FWDLFRSKD- -NMRTRIMIGLTLVFFVQITGQPNILFY 300 



8099 



sptif ksvGvsdsvasllvtiivgvvNfvf TfvaLif lvDrf GRR 

++t++ksvG+++ a 1++ +vgw +++T++a + lvD+ G ++ 
301 ASTVLKS VG FQSNEAAS LAS TG VG WKVI ST I PA- TLL VDHVGS K t f 1 c i 349 



8099 



8099 



8099 



8099 



8099 



+++ -++ + +++++ +++++ +++ +++ + + + + ++ 

350 gssvmaaslvtmgivnlnihmnf thicrshnsinqsldesviygpgnlst 399 

; .pllllGaagmaicf lilgasi 

++++ +++ ++ +++++++ + +++ +++ +a+ + +++ 

400 nnntlrdhfkgisshsrsslmplrndvdkRGETTSASLLNAGLSHT 44 5 

gvallllnkpkdpsskaagivaivf illf iaf FaigwGpipwvilsElFP 
+ ++ p + + + ++++ +l+++a+F++g Gp+pw++lsE+FP 
446 EYQI VTDP - GDVPAFLKWLSLAS LLVYVAAFS IGLGPMPWLVLS E I FP 4 92 

tkvRskalalataanwlanf iigf If pyitgaiglalggyvf lvf agllv 
++R++a+al+ ++nw +n +i+++f ++t + ig ++v ++++++ + 
493 GGIRGRAMALTSSMNWGINLLISLTFLTVTDLIG- - -LPWVCFIYTIMSL 539 

If ilf vf f f VPETkGrtLEeieelf <- * 
++ Ifv +f +PETkG +LE+i+ + 
540 ASLLFVVMF I PETKGCSLEQ I SMEL 564 



MCT : domain 1 of 1, from 29 to 567: score -23 5.8, E = 2.7 

*->kpPDGGwGWvWfasFlingfvdGf iks fGvf f sellqeet 

( pP W+ +*F + +v +++ + + G++ llq t 

8099 29 HPP-- WARGCGMFTFLSSVTAAVSGl lvgyeLGI ISGALLQIKT 70 

ifnesksdvdtAwIgSimlavIlfsGPls.SilvnrfGcRivmiaGglla 
1 s ♦ + + + +S + + * 11+s 1 +++1 +r+G R+ +i+ ++1 
8099 71 LLALS CHE - QEMWS S LV I GALLAS - - LTgG VL I DR YGRRTAI I LS S CLL 117 

gaGlllasFstniwelyltfGvitGlGfgf if qPaivilgqYF . eKrRsl 
t g+G 1+ + s ++ +1++ + +G+ ++++ + + v +++ ++ rR+1 

8099 118 GLGS L VL I LSLS YTVL I VGR - 1 AIG VS I S LSS I ATCVY I AE I ApQHRRGL 166 

AtGiAvaGsGvGtwfppllqf lidny . .GsDWrgal. . .lilggillnc 
++ +G+ + + + ++ + n+ +G W++ ++ -*- lg++ + + 

8099 167 LVSLNELMIVIGI-LSAYISNYAFANVfhG-- WKYMFglvIPLGVLQAI- 212 

vicGalllRPlepsvpqdekdkeqetlkeakkkkendtettkeeteplks 
a++ P ++++ +++ + a++k+ + + te+l++ 
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- PS PRFLVMKGQEGAAS KVLGRLRALSpTTEELTV 252 



8099 



8099 



80919 



8099 



8099 



8099 



rp^kasvlklQdakaersvds^lsskSvgerdksqlsekqksqasgrpsss 
253 I-r r ------- --i.---^—- 2S3 



i atavqlvllrsrlekadlpilcrvrvsrrrvlSkVS^aeSgtdgersSgylN 
+s+l " ' 
8099 254 ---- KSSL 



257 



rkdvFYtGsisNvaefkedpdkYrssslhgtrttvgnaesqstlrlddsr 



esgdgdsssedlsektrgdggkkessskeiretikkllDf svlknrtFll 

k ++ +++++ ++ + + + r ++ 
258 ---- KDEYQYSFWDLFR SKDNMRTRIMIG 282 

yaisnlf aslGf fvPlvf LvsYaikslgldekeAsf Llsi . iGvsnivGR 
+++ +++ G 1++ + ks g ++eA+ L s+++Gv+ +++ 
383 LTLVFFVQ ITGQPN I LFYASTVL - KS VGFQSNEAASLASTgVG WKVI ST 3 31 

pifGlvADkkgvrpTarhivyifnlsllalGlttlacPla'tsfwgLwyc 
+ ++1+ D+ g ++ + +s++a+ 1 t+ + + ++ +c 

332 IPATLLVDHVG^KT FLCIGSSVMAASLVTMGIVNLNIHMNFTHIC 376 

ilFGfs 

+++++ +++ ' ++++ ++++++ +++ ++ +++++++ + +4+ 
377 RSHNSInqsldesviygpgnlstnnntlrdhfkgisshsrsslmplmdv 426 



++++++++ + + + ++++ + +++++ + + + + + 
8099 427 dkrgettsasllnaglshteyqivtdpgdvpaf lkwlslasllvyvaafs 476 

- .' iGsygaLtfwLvdLvg 

+ ++ + ++ +++ ++ + +++ + +i ++Ltf +++dL+g 
8099 4 77 iglgpmpwlvlseifpggirgramaltsstnnwglNLLISLTFLTVTDLIG 526 

WlekfsnAfGllllfeGvavLvGPPiaGlLvDakttgdYtvaFyfsGill 
+ + ++++++ +a+L ++ + . + t+ + ++ + 
8099 527 LPWVCFIYTIMSLASLLFVVMF-- -IPE- -TKGCSLEQIS M 562 

llsgl<-* 
1+ + 

8099 563 ELAKV 567 
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LUSTAL W (1.74) multiple sequence alignment 



bh8099FL 
37021|GALP 

bh8099FL 
3 7021 | GALP 



bh8099FL 
3 7021 | GALP 



bh8099FL 
3 7021 I GALP 



oh8099FL 
37021 | GALP 



Dh8099FL 
37021 | GALP 



oh8099FL 
3702l|GAliP 



3h8099FL 
*702l|GALP 



3h8099FL 
17021 | GALP 



)h8099FL 
17021 j GALP 

>h8099FL 
17021 {GALP 



, ^Y^if?%J?^^^ ^^^O^^TAVETEGSGSI^PPWARGCGMFTFLSSVTAAVSGLLVGYPT n 
- - TRANS ~ -RTERMPDAKKQG- - -/-RSNKAMTFFVCFLAALAGLLFGLDIG 

• :: - .:.:*• :**: ... **::***..* ..* 

ISGALLQI KTLLAliSCHEQEMVVS SLVI GALLAS LTGGVL I DRYGRRTAI ILSSCT irr 

f AG ** PF ^ EFQITS J T Jf^ SS ™ FG ^ 

t * :s * ***■****:::,** :.::.** :**:.:::• * • 

SLVLILSLSYTVLIVGRIAIGVSISLSSIATCVYIAErAPQHRRGLLVSLNELM'lVTGT 

SLFSAAAPNVEVLILSRVIjLGLAVGVASYTAPLYLSEIAPEKIRGSMISMYQLMITIGI 
: . • ■***■■*'■■ :*:::.::* ::. : * : : : . 

SAYISl^AFANVFHGWKYMFGLVIPLGVLQAIAMYFLPPSPRPLVMKGQEGAASKVLGR 

GAYLSDTAFS - YTGAWRWMIK3VI 1 1 PAILLLIGVFFLPDSPRWFAAK^F^ERvfSS 
.**•:*: **; ..*..*.*...* ..* *...******.._ * . ^ . + 

■ RALSDTTE- ELTVIKSSLKDEYQYSFWDLFRSKDNMRTRIMIGLTLVFFVOITGOPMTT 
RDTSAEAKRELDEIRESLQVK-Q-SGWALFKENSNFRRAVFLG^ 

I * 

YASTVLKSVGFQSNEAASIASTGVC^KVISTIPATLLVDHVGSKTFLCIGSSVMAASr.^ 

YAPKI FE^GYTNTTEQMWGTVIVGLTNVIATFIAIGLVDRWGRKPTLTLGFLVMAAG - 
**..:::*.*: . : . **...*..*. * ,***. * * * . * **** 

TMGIVNLNIHIWFTHICRSHNSI^ 

-MGVLGTMMHI — • qi_^jjS 

MPLRNDVDKRGETTSASLLNAGLSHTEYQI VTDPGDVPAFLIOT 

_p SA 7" " Q Y FA I AMLLM F I VGFAMS J 

GPMPWLVLSE IFPGGl RGRAMALTS SMNWG INLLI SLTFLTVTDLIGLPWVCFIYTIMS1 
GPLIV^CSEIQPLKGRDFGITCSTATIW^ 

...... ** *••?_ ★ ***. . . * . 



★ * . * . - *** * * 



ASLLFVVMFIPETKGCSLEQISMEIJ\KVlTY\n<KMICFMSHHQ 

LFILLTLWLVPETKHVSLEHIERNLMKGRKLR- - -EIGAHD- - - - - - - - - - - . 

:*:.: ::**** ***:*. : * * ■ . . ' : : * - 

ECNKLCGRGQSRQLS PET 
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h8 0 9 9 FL MVP VENTEG PS LLNQKGTA VETBGSGS RH P PWARGCG - MFTFLSS VTAA VSGLLVGYELG 

9830 | ARAE - - TRANS --P-- RTERMVTTNfTB^ALT - PRSLRDTRRMNMFVS - VAAAVAGLLFGLDIG 

h8099FL IISGALLQIKTLLALSCHEQEMWSSLVIGA^ 
983 0 | ARAE VIAGALPFITDHFVLTSRLQEVAAPSSMMI^ 

, * * ** :.:* . * * * **: : • - -"*.'*" 

18099FL GSLVTj I LSLS YJVLI VGR I AIGVS ISLSS I ATCVY I AEI APQHRRGLLVSLNELMI VIGI 

983 0 | ARAE G S I G S A FAT S VEM L I AAR WLG I AVG I A S YTA P L Y US EMAS ENVRG KM I SM YQ LMVTLG I 

18 0 9 9 FL LSA Y I SNYAFANVFfIG - WKYMFG - LVI P LG VLQA I AMY F,L P P S PRFLVMKGQEGAAS KVL 

983 0 | ARAE VLAFLSDTAFS - - YSGNWRAMLGVLALP- AVI^LI ILWFLPNS PRWLAEKGRH I EAEEVL 

. . ■* * . *.* * . * *** * . * * w * * * . * **. * J** 

I .a • • • « • • • • • • • ■ ■ m m 

18099FL GRLRALSDTT-EELTVIKSSLraEYQYSFWDLFRSKIDNMRTR 

9830 | ARAE RMLRDTSEKAREELNE I RESLKLK - Q - GGWALFKINRNVRRAVFLGMLLQAMQQFTGMNI 

*★ * . . * ** * • ** * - * * * * : : * ; * : : : * - * • * - * * 



18099FL 
9830|ARAE 



18099FL 
9830 | ARAE 



I LFYASTVLKSVGFQSNEAAS LASTGVGVVKVTSTI PATLLVDHVGSKTFLC IGSS VMAA 
IMYYAPRIFKMAGFTTTEQQMIATLWGLTFMFATFIi^ 

::* . ** :.* *.: **:.* *. * ** **** 

S L VTMG I VNLN I HMN FTH I CR SHNS I NQS LDE S V I YG PGNL S TNNNTJLRDHFKG I S SHS R 
GTLVLG- - - - YC — 



18099FL 
>830|ARAE 



SSLMPLRNDVDKRGETTSASLLNAGLSHTEYQIVTDPGD^ 

- - LMQFDN G-TASS • GLS WLS VGMTMMC I AGYA 



18099FL IGI^PMPWLVLSEIFPGGIRGRAMALTSSMNWGINUjISLTFLTVTDL 
) 8 3 0 | ARAE MSAAPWWI LCSE I QPLKCRDFG I TCSTTTNWVSNMI IGATFLTLLDS I GAAGTFWLYTA 



18 0 9 9FL MSLASLLFWMFI PETKGCSLEQI SMELAKVNYVKNNICFMSHHQEELVPKQPQKRKPQE 

>8 3 0 I ARAE LNIAFVGITFWLI PETKNVTLEH I ERKLMAGEKLRN - IGV- - - - - - - : 

- .*" .--•***** = **2* z* : * . 



18099FL 
)830 | ARAE 



QLLECNKLCGRGQSRQLS PET 
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09830 I ARAE 
3702,1 J GALP 
bh8099FL 



09830|ARAE 
37021 (GALP 
bh8099FL 



1983 0 [ARAE 
3 7021 J GALP 
3h8099FL 



)9830|ARAE 
17021 [pALP 
>h8099FD 



>9830 I ARAE 
.7021 JGALP 
>h8099FL 



9830 1 ARAE 
7021 | GALP 
•h8099FL 



9830 lARAE 
7021 | GALP 
H8099FL 



9830 | ARAE 
7021 | GALP 
h8099FL 



9830 I ARAE 
7021 JGALP 
;i8099FL 



3830|ARAE 
7021 | GALP 
18099FL 



>830 lARAE 
'021 J GALP 
18099FL 



TRANS PRTERMPDAKKQG - - - - - — - - RSfSSpS^ GLDl G^ 

MVPVE^EGPSI^QKGTAVE^ 

. . » w . *■ • ■• * . 



iJSf 2? P H ^ TS ^ E ^ SS ^^^^GALFNGWLSFRLGRKY, 




SMYQLMVTLG I V 



* • i * - * * 



RLRALSDTT 



~ ^ * * * k^^E YQ YS F WDLFRS KDNMRTR I M I GLTL VF FVQ I TGQPN I 



^APRIFKMAGFTTTEQQMIATLWGLTFMFATFIAVFTVDKAGRICPaTTrT^c.0,^. 
MYYAPKIFEIAGYTin'TEQMWGTVIVGIiTNVIiATFIAIG 

LFYASTVLKS VGFQSNEAASIASTGVGVVKVI ST!r PATLL^^GSIOT?LCIGSS\^AAS 



rwgrkptltlg flvmaag 

HVGSKT 



TLVLGYCLMQFDN 

MGVLG-TMMHI ' GTASSG- - 

^^T^* ^ ^ HMNFTH I CRS HNS I NQSIjDES VI YGPG^^STWNITOL^HFKG I S S HS RS 

--"III III " LS WLS VGMTMMC I AG YAM 

SLMPLRNDVDKRGE^TSASLliNAGL^HTEYQI 

■ « • • • s • 5 

SJS'SlS'Sf iiSS" - lk ^ fgitcst ^^snmi.igatfltlldsigaagtfwlyt 

. . . :: *** * :: *....*.* ** **** : . .* 

ALNI AFVGI TFWL I PETKNVTLEHIERKLMAGEKLRN IGV---- 

ALNVLFILLTLWLVPETKHVSLEHIERNLMKGRKLRE- - --IGAHD- - " 
IMSLASLLFVVMFIPETKGCSLEQISMELAJCVNYVKNNICFMSHHQEELVPKQ 



EQLLECNKLCGRGQSRQLSPET 
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>h8099FL 
)2168Patent 



, MVPVENTEGPSLLNQKGTAVETEGSG — - SRHPP WARG - CGMPTFLSS VTAAVSGLLVGY 

MTSDHEHMTAVCASHVQTHGSQLQIQKLSPCFRPPTPAFRISSS 1 1 LLGAG - LAGP 

. .'*.*** •-'■** * * ^ ** . • •**•* 

• • •••• • • • • » m ^ 



>h8099FL 
)2168Patent 



)h8099FL 
>2168Patent 



EI/5IISGAJjLQIKTLLALSGHEQEMWSSLVIGALIJVSLTGGVLIDRYGRRTAIILSSCL 
STGDRWFGVSWGTGLFLPPLQLLLPPRLLFTHAILERLHLWLALPPVLVLGHMiLH-CK 



t : * * * 

l 



* i 



* * . * * 



★ ■* 



LGLGSLVLILSLSYTVLIVGRIAIGVSISLSSIATCVYIAEIAPQH- -rrgllvslnelm 
vggstaragdqlvqrvlll-ivflhrwvqwpzgtevdilgmgsrtggrrgpelrp G 

. * . , * **..... **•*•.. . * * * 



)h8099FL 
)2168Patent 



IVIGIIjSAYISNYAFANVFHGWKYMFGLVIPLGVLQAIAMYFLPPSPRFLVMKGQEGAAS 

frisilsayisnyafanvfhgwkymfglvipi^vlqaiamyflppsprflvmkgqegaas 

* ************************************** 



>h8099FL 
>2168Patent 



KVLGRLRALSDTTEELTVIKSSIjKD^ 

kvlgrlralsdtteeltviksslkdeyqysfwdlfrskdnmrtri^ 

* * * ************************************ * * *********** * * ****** 



>h8099FL 
21€8Patent 



PNILFYASTVLKS VGFQSNEAASLASTGVGWKVIS 1 1 PATLLVDHVGSKTFLCI GSSVM 

PH I LF YA3TVL KS VGFQ SNEAAS LAS TGVGWKV I S T I PATLLVDHVG S KT FLiC I G 

******************************************************** 



»h8099FL 
2168Patent 



AASLVTMGIVNLNIHMNFTHICRSHNSINQSLDESVIYGPGNLSI^^ 



»h8099FL SRSSLMPLROT^VDKRGETTSASLI^A 

2 16 8Patent -LLNAGLSHTEYQIVTDPGDVPAFLKWL.SLASLLVYVAA 

******************** ***** ****** ******* 

h8099FL FS IGLGPMPWLVLSEI FPGGI RGRAMALTS SMNWGINLLI SLTFLTVTDL IGLPWVCFI Y 

2168Patent FS IGLGPMPWLVLSEI FPGGI RGRAMAliTS SMNWGINLLI SLTFLTVN -L IGLPWVCFI Y 

******* ****************** ******* *************** . *********** 

» 

hBO 99FL TIMSIASLLFVVMFIPETKGCSLEQISMELAKVNYV^^ 

2 168Patent TIMSLASLLFVVMFIPETKGCSLEQISMELAKVNYVKNNICFMSIffl 

************************************************************ 

h8099FL QEQLLECNKLCGRGQSRQLSPET 
2168Patent QEQLLECNKLCGRGQSRQLSPET . 

*********************** 
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Input file Fbh464S5PL.se9output File Fbh46455FL. era | 
Sequence length 2230 • > 

GTCGACCCACGOSTCCGGCAACATGGCGGCTGCCGTGGTGGAGCfKrCCGqqCTGA 

TACCCCGGGTGAGG«nx^CTCCGCCnX3GGATCGl^CCTCTTC ' 

GCACGTCCCCTCCCX:GCTGTCrrGTCTACTGAGACGtXMAGGCGTGACAGGGCCCGa3TC^ 

GCTTCAGGGCAAGCTCCCCGTCTCCGGQCGCACTTCCCTCGC^ 

CCTCGCAGGTGGGATCGTCGGTGGGACCGGAGCGCGGGCGGGCGCGGCCpCCCGGGACCl ATG GCC GGG TCC GAC 15 

T A P F. L S O A D D P - D D G P V P G r o -> = 

ACC GCG CCC TTC CTC AGC CAG GCG GAT GAC CCG GAC GAC GGG CCA GTG CCT GGC ACC CCG l\ 

G L P G s T G N P K S E E . P' E V P n o n- 
GGG , TTG CCA GGG. TCC ACG GGG AAC CCG, AAG TCC GAG GAG CCC GAG GTC CCG GAC CAG GAG 13 5 ' 

G L Q R I V G Li S P G R S A L I V A V t *c 
GGG CTG CAG CGC ATC ACC GGC CTG TCT CCC GGC CGT TCG GCT CTC ATA GTG GCG GTG CTC 195 
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G E T P P C L P rt - G t D S C , S S a ^ - *~d "*j 3 t ; 
GGG GAG ACC CCA CCC TGC CTT CCC GGA GAC TCC TGC TCT TCC TCT GAC AGT CTC 

G L I T C L .' T • G IV L G V G L. G V E I r 
GGA CTC ATC ACC TGC CTG ACC GGA GTC CTG GGT.GTG GGC CTG GGT GTG GAG ATC AGC CGC 

R L R H S N P R A D P L ■ V. C A T G L L 



I F J . C E T L L S M N W A I 



L Y V JV r P , T R R S T A E A F * Q 



N W P P S F L S E F R A L Q F S L 

\AC TGG CCC CCC TCC TTC TTG TCC GAG TTC CGG GCT CTG CAG +TC TCG CTC 

i 

A F V G.A L G G A A F L G T A I p 
5CG TTT GTT GGG GCA CTG QGd GGC GCA GCC TTC CTG GGC ACC GCC ATC TTC 

1 I 



V L I 
GTG CTC ATC TGA 

GAGGCTGCCGCTCACCTACCTGCACATCTGCCACAGCTGGCCCTGGGCCCACCCCACGAAGGGCCTTC 
TGGCCTG<^CCAGCTTCCAGAGGGACCCTGGGCCGTGTGCCAGCncCCAGACACTA 
TGGGGGTCCAGGAGGGGGATCCCTCTCCACAGGGGCAGCCCCAAGGGCTCGGTGCTATTl^T 
AGCCAGAAAAAAAAAAAAAAAGGGCGGCCGC ' , 
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Protein Family / Domain Matches, HMMer version 2 



PCT/US01/49060 



Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright <C) 1992-1998 Washington University, School of Medicine 

HMMER is freely distributed under the GNU General ■ Public License (GPL) . 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pfam6 .4/Pfam 
/prod/ddm/wspace/orf anal/oa^script . 9015 . seq 



Query:. 4 6455 

Scores for sequence family classification (score includes all domains): 



Model 



Description 
Sugar (and other) transporter 



sugar_tr 

1 » . . 

Na_Galacto_symp Sodium:galactoside symporter family 
' 1, ... 
MCT . Monocarboxylate transporter 

• 1 ... 



Score 

-63 .4 
-121.2 
-208.2 



E- value N 
0.00016 
0.17 
0.32 



Parsed for domains: 
Model Domain 



seq-f seq-t 



MCT 1/1 60 473 

sugar_tr l/l 58 4 87 

Na_Galacto_symp l/l 212 505 



hmm-f hmm-t 

1 611 

1 488 

1 285 



score E-value 

U -'208.2 0.32 

[) -63.4 0.00016 

[] -121.2 0.17 



Alignments of top-scoring domains: 

MCT: domain 1 of 1, from 60 to 473: score -208.2, E = 0.32 

*->kpPDGGwGWvWfasFlingfvdGf iksfGvf f sellqeetlf nesk 
++V in + ++++ ++ ++ e++fn+++ 
46455 60 L I VAVLCY I NLLN YMDRFTVAG VLPD I EQFFNIGD 94 

sdvdtAwIgSimlavllf sGPlsSilvnrf GcRivmiaGgllagaGllla 
s ++I ++++ + ++P+ + 1 +r+ ++ m+ G ++ ++ 1 + 
4 6455 95 SS - -SGLIQTVFISSYMVLAPVFGYLGDRYNRKYLMCGGIAFWSLVTLGS 142 

sFst . .niwelyltfGvitGlGfgf if qPai . vilgqYFe KrRsl 

, sF ++++ w+1 It G ++G+G + ++ + ++ ++ F +++++ +s+ 

46455 143 S FI PgeHFWLLLLTRG- LVGVGEA- SYSTIApTLIADLFVadqrsRMLS I 190 

AtGiAvaGsGvGtwfppllqf lidnyGsDV?rgallilggillncvicGa 
+ GsG+G +++ ++ d G DW +al+++ g+ + v++ + 

46455 191 FYFAIPVGSGLGY- IAGSKVKDMAG - DWHWALRVTPGLGWAVLLLF 235 

lllRPlepsvpqdekdkeqetlkeakkkkendtettkeeteplkslpkas 
1 +R+ +p++ ++ 
46455 236 LWRE - - - - P PRGAVER : 248 



ilkledakaersvdsLlsskSvgerdksqlsekqksqasgrpsssatavq 



46455 



lvllrsrlekadlplkrvrvsrrrvlSkVSaeSgtdgersSgylNrkdvF 
+s+l 

46455 249 HSDL 

YtGsisNvaefkedpdkYrssslhgtrttvgriaesqstlrlddsresgdg 



252 



46455 
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dsssedlsektrgdggkkessskeiretikkllDfsvlk.nrtFll . ... • 

+ + + + ++ D+ + 1 +n++F+l++ + 
46455 253 - - PPLNPTSWVJADLRALArNPSFVLsslg 579 

.yaisnlfaslGf f vPlvf LvsYaikslg . . . ; IdekeAsfLls 

a++++ sl++ + P+ + L ' s++. • ilg+*+++ H- +++. ++++ js++ 
, 46455 280 f TAVAFVTGSIALWAPAFLLRSRV- - VLQetppclpgdsCSSSb-SLIFG 326 ' 

iiGvsnivGRpifGlvADkkgvrpTarhiyyif nlsllal . .GlttlacP 

i + v + + +G+ + .+++ r + ++ 11 . ++1++ ++ 
LITCLTGVLGVGLGVEISRRLRHSNPRADPLVCATGLLGSapFLFLSLAC 376 



46455 327 



latsfwgLv^cilFGfs.iGsygaLtfvvLVdlivgWlek fsnA 

+.S++ +++ i + G + +++a,+ + +L vi + ++++ + f+ ' 
46455 377 ARGSIVATYIF- XFIGETlLSMNWAIVADILLYWI- PTRrstaeaFQIV 424 

fGllllfeGvavLvGPPi. . . . aGlLvDakttgdYtvaFyf sGillllsg 
+ + 11 G + L+G + + + + + ++ + +• + + + + + +++ + 

46455 425 LSHLLGDAGSPYLIGLISdrlrRNWPPSF-'-LSEFRALQFSLMLCAFVGA 472 

i ...» - ' ■ 

• - • *■ 1<~* 
1 

46455 473 L 473 

sugar_tr: domain 1 of 1, from 58 to 487: score -63.4, E - 0.00016 

*->valvaalgGgflfGyDtgviggflalidf If rfgll tssgalaelvg 
al++a+ + + + +'+++ + + + f+ + +s+ 
46455 58 SALIVAVLC- YINLLNYMDRFTVAGVLPDIEQFFNIGDSS — 96 

ys tvl tgl ws i f f iGr 1 iGsl f aGklgdr f GRkks 1 1 ia 1 vl f viGal 1 
+gl+ ++f + ++ + ++G+lgdr+ Rk+ + + +++ + ■ +1+ 
46455 97 SGLIQTVF I SS YMVLiAPVFGYLGDRYNRKYLMCGG IAFW.S LVTLG 141 

sgaapgytTiGlwafyllivGRvlvGlgvGgasvlvPmYisEiAPkalRG 
s ++pg +f+ll++ R lvG g s ++p + +i + + R 

46455 142 SSFIPGE -HFWLLLLTRGLVGVGEASYSTIAPTLIADLFVADQRS 185 

algslyqlaitiGilvAaiiglglnktnndsalnswgWRiplglqlvpal 
+ + s++ +ai +G +++i g ++ + + +d +w R+ gl+ v 1 
46455 186 RMLSIFYFAIPVGSGLGYIAGSKVKDMAGDW- - -HWALRVTPGLGWAVL 232 

i 

■ lUigllflPESPRwLvekgkleeArevLaklrgvedvdqeiqeikaele 
11++++ P r g + + ++ + +++ 

46455 233 LLFLVVREPP- ------ _ - -RGAVERHSDLPPLNPTSW 260 

atvseekagkaswgelf rgrtrpkvrqrllmgvmlqaf qQltGiNaif YY 
+ + + 1 r+++ +1 + + + a + +tG ++ + 

46455 261 WA- DLRALARNPS FVX.SSLGFTAVAFVTG- - SLALW 293- 

sptifks . vGvsdsvasllvtiivgwNfvfTfvaLiflvD 

+p ++ ++ +++++ +ds +s ++ i +g+++ ++ + + + i 
46455 294 APAFLLRsrwlgetppCLPGDS-CSSSDSLIFGLITCLTGVLG- VGLGV 341 

rfGRR . .pllllGaagmaicflilgasigvallllnkpkdpss 

+ RR ++++++ +pl++ ++ + + fl+l+ 1++ ++ 
46455 342 EISRR1 rhsnpradPLVCATGLLGSAPFLFLS LACARGS 380 

kaagivaivf illfiaf FalgwGpipwvilsElFPtkv. . . .Rskalala 
iv++++fi+ + + + w+i++++ v +++Rs+a a+ 
46455 381 - IVATYIFIFIGE-TLLSMNWAIVADILLYVViptrRSTAEAFQ 422 

taanwlanf iigf If pyitgaigl . . . . alggyvf lvf agl . . .Ivlfil 
++ 1 + + + py+ g i+ +++++++++ f + i + + x 1+ + 
46455 423 I VLSHLLGDAGS PYLIGLISDrlrrNWPPSFLSEFRALqf sIiMLCAF 469 
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fvf f fvPETkGrtLEeieelf <-* 

++ '++ + + . G +x t . 

46455 470 \AjALGGAAFLK3 TAIFIEA 487 

Na_Galacto_symp: domain 1 o*£ 1, from 212 to 505 score -121.2, E 1 = .0.1-7* 

t *->qlG.yf f falV. I .LsaagwliwiCf f gtkEvySssdtreng 

+ G++++++ V+++L++++V+I1+++ +++ ++g E+ sd ++ + 
46455 212 MAGdWHWALRVtpgLGVVAVLLtFLVVre jJpRGAVERH- - SDLPPLN 256 

qkttsllqslkllakNdQ. . LliLclaalf yllainilgg , aqlYYvtYv 
+ + + + l++la+N++ L L++ a .+ '++ +1 ++a 1 + v . 

46455 257 PTSW-/-WADLRAUUWPSfvLSSU3FTAyAFVTG 304 

LG.dpelFS ylllynilvgligslLfPrLvkrf . .gkktv 

• LG++p +++++ +++++ +++1 + +1 g g+ L ++r++++ ++ 
464^5 305 I^eTPPCLPgdscsssdslIFGLITCLTGVI^^ 

..' FagcivlmvlgslliFfvagsslal . ilvlif lagilqqlvtllvWvlQV 
+ ++lg ++ F++ +++ a + +v++ ++ + + + +W++ 
46455 355 DPLVCATGLLG-SAPFLFLSLACARgSIVATYIFIFIGETLLSMNWAI-- 401 

' IMvsDtVDYGEwktGvRlEGlvySvf If vlKlGlAlsGalvGwiL . .gyi 

v+t)+ Y t +R+ + + -H+ 1 .1 1G A s 1+G+i + + 

46455 . 402 - - VADILIiYWIPT-RRSTAEAFQIVLSHL-LGDAGSPYLIGLISdrLRR 447 

I * 

GYvanasqststalgQlvf ilalFalPpallllaaf imlrf YkLtekkle 
» + ( ++ . s al+ f 1 1 a++ al +a + ++ f + + + + 
46455 448 NWPPSFL-SEFRALQ FS LMLtCAFVGALGGAAFLGTAI F I EADRRRAQ 4 93 

eIveeLekWrtrkrk<-* 
v L+ + + ' 
46455 494 LHVQGLL- - -HEAGS "V 505 
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jUSTAL W (1.74) multiple sequence alignment 



>h46455FL 
'2825 



>h4 6455FL 
>2 825 



>h4 64 55FL 
»2825 



»h46455FL 
2825 



h46455FL 
2825 



h4 64 55FL 
2825 



h46455FL 
2825 



MAGSDTAPFLSQADDPDDGPVPGTPGLPGSTGNPKSEEPEVPDQEGLORITGLSP r 
MVRNKVA P VEDQAN I QRNF E P P - - p.- - PYTT- - P - TDS PEDK I RSNSTATTASQPEFOGC 
■ ... • . j * • 

tyS^VLl VA VLCY I N LLN Yl^DR FT VAGVL PDI EQ FFN I GDS SSt3h I QT VF I S S YMVLA pupr 

WTIVWAI LFI INLLNYMDRYTIAGVLNDVQTY YNI SDAWAGLI QTTFMVFF 1 1 Fq P Trr 
': ::*.*:* **** *f ***.*.****.*.. . * 

YLfGDRYNRKfYLMCGGI AFWSLVTLGSSFIlPGEHFWLLLLTRGLVGVGEjAS YSTIAPTI T A 
' FLGDRYNRKWI FWGIAI WVSAVFASTFI PSNQFWLFLLFRGI VGIGEASYA I I S'PTVT A 
:********::: *** ; * ...:.*:***..;:***:** **;**;♦****. 

. • . • r . • . 

DLFVApQRSRJiLSI FYFAI PVGSGLGYI^SIWK^MAGDWHWALRVTPGLGVVAVLLLFL 

DMFTGVLRSRMLMVFYFAIPFGCGLGFVVGSAVASWTGHWQWGVRVTGVLGIVCLLLITV 
***** ;**,*♦*** . * m *** . . a ** * .-:*.*:*.;*** **.* .**,.. 

.... .... 

VVfREPPRGAVERHS-DLPPIJtfP-TSWWADL^ 
FVREPERGKAEREKGEIAASTEATSYLDDMKDLLSNATYVT^ 

.*'*** **.**.. : : . . . ' ** : * : : * * . : : * ****".**..* s- -*. ** ** 

PAFLLRSRVVI>GETPPCLPGDSCSSSD^IFGLITCLTGVIiGVGIX3MEIS - - -RR- ---L 
P I T 1 QYADS AR - RNGT I TE - DQKAN I N - LVFGALTC VGGVLGVA I GTLVSNMWS RG VG P F 
* : : - - - - *. : *:** : ** : ***** -: *^ . * * 

RHSNP-RAD^VCATGLLGSAPFLFLSLACJ^ 

KH I QTVRADALVCA I GAA ICIPTLI LA I QNI ESNMNFAWGMLF I CI VASS FNWATNVDLL 

::::** .*.*** *.* 



* - * * * ***** 



* * - * . 



h46455FL 
2825 



h46455FL 
2825 



LYWJI PTRRSTAEAFQpC VLSHLLGDAGS PYLI GLIJSDRI,RRNWP PS FLSEFRALQpSLiMIj 

LSWVPQRRSSASSWQILISHMFGDASGPYILGLISDAIRGNED-TAQAHYKSLVTSFWL 

* **.* ***.* . : ** ; t **:.:*** . .**: :*** * * .* * . ; . . . * * . * 

CAFVGAXiGGAAFLGTAI F ^EADRRRAQLHVQGL.LHEAGS TDDR I VVPQRGRSTRVP VAS V 
CVGTLVLiS VILFG I S AI TWKDKARFNE I MLAQANKDNTS SG - -TLPIEDRNTEDETGSE 

* . *. * . ** - *. * . . .... ■ * * * 



M6455FL 
2825 



VQHM 
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Input file 54414; Output File 54414. tra 

Sequence length 4632 ' ' ■ ■ 

CACCXTGTCCC^CCACGCGTCCGCCCACGC^ 

CCCCTCCCTGTCCCCTTCCCC^ 



TATTTTCTTTCTTTCTC ^ ^ 
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" P R L Q D Y Y ' V V I L C 'P. T E M 'd' V ' o 
CAT CCT AGG CTC CAG GAT TAT TAT GTG GTG ATT 'TTG TGT CCT ACT GAA ATG GAT GTA CAG 

V R "« V L 0 I P M W S <? R V ' r Y L Q G S 
.GTT CGA AGG GTA CTG CAG ATT CCA ATG TGG TCC CAA CGA GTT ATC TAC CTT CAA GGT TCA 

'* L K D Q D U L R A. K M D D i A E A C F T ,o, 
GCC CTT AAA GAT CAA GAC CTA TTG AGA GCA AAG ATG GAT GAC GCT GAG GCC TGT TTt' ATT 1149 

L S S R C E V D 'r T S S D H Q T I* L R A aHi 
CTC AGT AGC CGT TGT GAA GTG GAT AGG ACA TCA TCT GAT CAC CAA ACA ATT TTG AGA GCA 1209 

WAV K I D > A. P N C PLY V Q j' l K P E 
TGG GCT GTG AAA GAT TTT GCT CCA AAT TGT CCT TTG TAT GTC CAG ATA TTA AAG CCT GAA 
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1269 



N K F H I K F A D H V V C E E E F K Y A 
AAT AAA TTT CAC ATC AAA TTT GCT GAT CAT GTT GTT TGT GAA GAA GAG TTT AAA TAC GCC 1329 
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0 H N y. y E D A K A .Y G F K . N K L I X V 
CAA CAT AAC TAC TAT GAG GAT GCA AAA GCC TAT .GGA TXp AAA A^T AAA CTA ATT ATA GTT 2i69 

_ E ' T A ° N G L Y N .. F I V P L . R A Y y 743 



GCA GCT GAA ACA GCT GGA AAT GGA TTA TAT AAC TTT ATT GTT CCT CTC AGG GCA TAT TAT 2229 

R P K K E L N P I V L L L D N P i, D D L 763 

AGA CCA AAG AAA GAA CTT AAT CCC ATA GTA .CTG CTA TTG GAT AAC CCC CTA GAT GAC TTA 2289 

L R C 9 V T F A A N M V V V D K EST M 781 

CTC AGG TGT GGA GTG ACT TTT ' GCT GCT AAT ATG GTG CTT GTG GA^ AAA GAG AGC ACC ATG 2349 



A E E V Y M A D A K T I V N 
3CC GAG GAA GAC TAC ATG GCA qAT GCC AAA ACC ATT GTG AAC 

U F S S L S I I T E L T ' H P 



M 



M Q F R A K D C Y S L A L S K L E K K 
&TG CAA TTC AGA GCC AAA GAC TGT TAC TCT CTT GCT CTT TCA AAA CTG GAA AAG AA/ 

R E R G S N L A F M F R L P F A A G R 
ZGG GAG AGA GGC TCT AAC TTG GCC TTT ATG TTT CGA CTG CCT TTT GCT GCT GGG AGC 

F S I S M L D T L L Y Q S F V K D Y M 

rrr agc atc agt atg ttg gac act ctg ctg tat cag tca ttt gtg aag gat tat atg 
S I t r l l l g l d t t p g s g f l c 

rCT ATC ACG AGA CTT CTG TTG GGA CTG GAC ACT ACA CCA GGA TCT GGG TTT CTT TGT 
M K I TA D D L W X R T Y A R L Y n xc 



E D 'T K D S K E Q G H H R S K H R N 
3AA GAC ACC AAA GAC TCC AAA GAA CAA GGG CAC CAC CGC AGC AAC CAC CGC AAC 

s S D Q S D H P h L R R K S M Q W A 



N L Y R R S E 



S T V G Y DEM N D H Q S T 
rCT ACA GTG GGA TAT GAT GAA ATG AAT GAT CAT CAA AGT ACC 

N P S P D T R I E L N D V V 
\AC CCA TCT CCA GAT ACC AGA ATA GAG CTG AAT GAT GTT GT/> 
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L 1 R P D P L A Y L .P N S E P s R R m 
TTA ATT CCA CCA GAT CCA CTG GCC TAC CTT CCA ,AAC AG.T GA G ' CCC ACT CGA AGA AAC AGC Hoi 

I C N 1 V T G Q D S R E E T Q L • ' 
ATC TGC AA.T GTC ACT GGT CAA GAT TCT CGG GAG GAA ACT CAA C?TT TGA 335* 

TAAAAATAAAATGAGAAACTTTTTTCCTACAAAGACCTTGCTTGAAACCACA^ 

GATGGAAATATATGTAATTCTCTCATATTTAAAAACGTAATCTCTTCTCTTAGAAGTATA 

GTACTACTTACTGGTACTCTCCCTATTAATATTTGAAGGACCT^ 

AAAATTTAAATCTGACATTTAATTGTTTTATAATAATCC^AAACTCTAlXjAAAGC 

tgaagttgacaaaatctaactatatttggtgcaItcacaatggacacagaatgct^ 

GTCATATT^TATTCTTTAAACTTACTGTTTTACAAA^ 
ACCAACAAACTTGTGTGGCTGACTTTTGTGTAAGAATCATAGTTTG^ 

tttttttCtgccttacgatataaaaatatttatcttagaat^ 

CTAAAACATAAATGAAAAGAAACACTAGGTTCCTGCACTTTTTGGTAACTTTAl^ 

AAAG<^TACTATAAAGCAAATATCTATTATTCTCCrAAACGAATGCCTAGC^ 

TGACTTAAATTTAATTCAAGGATTGAAAAATTAACIXSGATATCT^ 

AACTl^ACCATCTTTCCTATTCATATATCTATATAGTAAATTTC 

TTTATACCTTTTATGGACTCTGAAGACACTTCAAAACATTAAAAGTCCr^ 

TGANGATGTATTAAATTTTGACTT 
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WO 02/055701 

Protein Family / Domain Matches, HMMer version 2 



PCT/US01/49060 



Searching for complete domains in PFAM 

hmmpfam - search a single" se^ against HMM database ' ' 

HMMER • 2 .1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine, p 
HMMER is freely distributed under the iQNU General 'Public License (GPL) 



HMM file: 
Sequence file : 



/prod/ddm/seqanal /PFAM/pf am6 . 4 /Pf am 
/prod/ddm/wspace/orf anal/oa- script . 3743 . seq 



Query: Fbh54 414 



Scores for sequence family plassif icfation (score includes all, domains) : ' 1 
Model Description Score E- value N 



ion^trans Ion transport protein 
1 ' 



.62.4 9.9e-15 



Parsed for domains: 

Model ' Domarri seq-f seq-t hmm-f.. hmm- t 



ion trans 



1/1 



104 



277 



223 [] 



score E-value 



62.4 9 r 9e-15 



Alignments of top-scoring domains: 

ion_trans: domain 1 of 1, frq>n> 104 to 277: score 62.4, E = 9. 9e-15 

■ *->ilf ildllfvllflleivlkf iayglkstsniaakylksif nild^l 
t ++++ ++ + ++ +i+l + +y ++ + + + + + + i + il ++ 

Fbh54414 104 ■' LWGLQVSVALISLFETILLGYLSYKGN IWEQILRXPFILEII 145 

ailplllllvlf lsgteqvakkrlrerf slelsqwyyrilrf lrlLrllR 
++P+++++++ ++ 1 + ++ 

Fbh54414 146 NAVPFI ISIFWPSLRN- LFVPVFLNCW- 171 

ILrllrllrrletlfefelgtlaWsltislgralksilrfllllillllgf 
1 + + +++ + + i + ++ r++ ++++ +1+1+ 1++ 
Fbh54414 172 - - - LAKHALENM INDL HRAI QRTQS AMFNQ VL I LI STLLCL 209 

svigyl 1 f kgyedl senevdgnse f s sy f da f yf 1 f vt 1 1 1 vGf Gdl vpv 
++ + +++ e + ++ ++fd++yf ++vt++tvGfGd++p+ 

Fbh54414 210 I FTC I CG I QHLER - IGKRLNLFDSLYFCI VTFSTVGFGDVTPE 251 

. wlgiif fvlf f iiygllllnlliavi<-* 
+w++++f+ + +i+v+l++l + + + 
Fbh54414 252 t WS S KL FV - VAM 1 CVALWLP I Q FEQL i 277 
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CUUSTAL. W U.74) multiple sequence alignment 



544 14 .prot 
AF0897 30 



54414.prot 
AF08973O 



S4414.prot 
*VF089730 



54414.prpt 
AF069730 



54 4 14. prot 
AF089730 



54414 .prot 
AF089730 



t MVDLESEVPPLPPR-t-YRFRDL — ^LGDBGWQN ' 

HARAKLPf^PSECKAGPGDTPAGSAAP 

* • * • - * * . 

T/M " 



DD WQVEFYt^E^TrVJEPJJ^rFIK^nSSLRU LFNFSLXLLSCLLYI IRVLL ENP 



SQGN ' . EWSHIFWVNRSLF ^WGLQVSVAL I SLFET I LUTYLSY ( 

DQGIGCWCXrnC^YTFNGSSSEFW 



T7M3 



KGNrWBQ CLRIPFILEIINAVFFIIS IFWPSLF ^JLFVPVFL^O/IJ e tKHAI#EI^ 

kgktweq r^W5E^^JSMI^^rt,PFII^^ ^ 



I QR tTQSAMFNC^ /LIL t STLLCLIFTCICG3 3HLERIGKRLN ^FDSLYFC IVTFSTVGPGD 
IU ^SAMFNC^ILFCTI^^ 3HLERftGGNtiN ^ltsfyfcivtfstvgfgd 

« *»»•*****#***.. ***•*.** ******** * * * * . *.************** 

~\TTpfawSSl£F\i^^ 



54414 .prot SUC IDI^IDFLNEFYAHPRLQDYYVVT LC PTEMDVQVRRVLQI FMWSQRVI YLQGSALKD 

AF089730 SIJCXDLJ>IDFXWEIFYAHPRLQDYy\A/IL^ 



544 14 . prot QDUJlAKMIXiAEACFII^SRCSVDRTSS^ 

AT089730 QDLMRAKMDNGEACF II*S SRNKl/DRTAADHQTILRAS^VKDFAPt^C PLYVQI LKPFUKFH 

54414 .prot IKFAIHVUCEEEFlCyAMLALNC ICPATSTLITLLVHTSRGQH3QQSPBQWQKMYG3<CS^24 

Mt>89730 VKF ADITVATCEEDCKY AMLALI^C I CPATSTL I TLLVHTSRGQBGQE1S PE(^JQBM)tGSlCSQ^ 

54414 .prot E>TmrVl^XSTFFAEYEG3<LSFTYASFHAHKKPG^ 

\FO89730 E\rmnU*3}SKFFREYBGI^FTY7^ 



54414 .prot DICFYXMITKEENSA — FKNQDQQRKSNVS — SFVHGPSRLPVHSIlASM3W7UIXi3DT 

AFO 89730 PTCFYTN 1 IIU^KNSAF IFKQE^KQKRRGIJV3QALYBGPSRL PVHS I IASM- - VAMDLCMT 



54414 . prot SCRSASGPT --I^PTEGSKEIRRPSIAPVLEVAOT 

AF089730 DOlPSQOTSG<XX3GXLTLPTEN^ 



*•***.**.*• 



54414. prot DEEMSSNI^AKGYPPYSPYIGSSPTFCHIXHEKVPFC^^ 

AFO B 9 7 3 0 DDBGUSWEYVKGYP PNS FYTGSS POTX>DLLPVKAPFCCLRIXKGCKHNSY 



► . * * * « 



54414 . pro t NKLI IVAAETA/SbJGLYNF 3TVPLRR.YYRPKKEIJWP rV/IJlXI^ P 

AF089730 NKLirVSAETAGNGI,Yl^^ 

i 

54414. prot UDDLI^CGVTFAANM^^ I IT 

AFO 897 30 EGSVIX^J^1^I>QCGI I YADt^VVVDKEST^ 
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EX.THPANMRFMQFKAKIXrYSLAL^C^ ISMLDTLL 
^TOPSNMRFMQFTU^KDS^ 



YQSFVKDYM I S ITRLUjG1X)TTPGSGFT>CSMK ITADDLWI RTYARL YQKJjCSSTGDVP IG 
YQSFVKDYM I TITRLLLGLDTT PGSGYIXIAMKVTEDDLtyl RTY/GRLFQKLCSSSAEI PIG 



I YKTESQKLTTS ESRK IASQSQI S I SVEEWEDTKDSK - EQGHHRSNHRNSTSSD 

I YRT BCH - VFS S E PHDLRAQSQ I SVI^MEDC EDTREAKG PW3TRAASGGGSTHGRHGG SAD 
***** . .** . . .*«***. .*'. ***...* * • * . * . • 

• - - ' i ■ - : 

QSDH PLLJ^RKSI4QWARRLSPJ<G?KHSGK TAEK ITQQRLNL YRRS EHQFXASLVKNRM 

F\/CTPLlJRRK£l^WARKLS^SSKQA^ 



54414. prot KHLGLSTVGY- - ' DEMND- HQSTLS YIL TtJPS PDTR I ELNDV 

AF0d9730 KHIjGLPTTGYEDVANI/FA^DV^^ 



£4414. prot VYL.IRPDPLAYLPNSEPSRRNS ICNVTG- - -QDSREETQL 

AF089730 VYLIRSDPIJ^VTSSSQSRKSSCSNKLSSCNPE^ 

*•*« ****«. . ^ * **. * * ^ ..*„*■*** 



54414 .prot 
... AF089730 



54414. prqt 
AF089730 



54414 .prot 
AF089730 



54414 .prot 
AF089730 
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Input fj]V» Fbh53763poc .s«*q; Output File FlihS 37fi 3 P ai . i r* 
Sequence length 28.4 7 ' 



» i 

CCACGCGTCCGGCCCTG+GCTTCGGATGGCGGCGGGAGGTTGATCGCGAGTCGTCCTCAA^ 
CTGATTTGGGGGAGAAACAAAATCTGCAGATGGAATCCGAGCAGGGCGACTTCACCTTCAAGTt^ 

C*TCCGGCCAGTCTCCACTCCATT<rACGGCCAGGCGATCT^CCCGCTCCCGGAGGGGTCGGGCAGTGCCGGCTGGACCCG 

CCCCGAGCTCCATGGTTTGCCCAACCGTGCIjCGATGGTGACTGTGGGCGCGGAGGTTGGCGACTGGCAAATC 

CACAGAATGAAGGCGGGGAGCGCGGCCGGCWKrCGGCGGGGGerTTC^ 

A^ACCTGAATCCGGAAAACC-CCAACAAGTAGTTTCrrCG^GGAG 

CGCTGCCCAGAGAACC'TCGTCCACTCGGAAACCAAAGCAGAACCACTTTTCTCTCGGTCTCGTTAAGTCATGTCTGAGT 
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°" P A L T Y V. E a V . C V V w. F T F e K 

rrr. aci 

, R 1 ' V F S - P N K t, E F -I K N L. 



CAT CCT GO" TTG ACG TAT GTA GAA GGA GTG • TGT GTG G.TG 'TGG TTT- ACT TTT GAA TTT TTA 



29* 
894 



GTC CGT ATT CTT TTT TCA CCC AAT AAA CTT GAA TTC ATC AAA W CTC TTG AAT ATC ATT JJJ 

GAC TTT GTC G^C ATC CTA CCT TTC TAC TTA GAG GTG GGA CTC ACT GGG. CTG TCA TCC AAA 10lJ 

A A K D V L G i F L R . V V R p" V 'ft T r t> 
OCT CCT AAA GAT GIG CTT GGC TTC CTC AGG GTG GTA 'AGG TTT GTG AGG ATC CTG AGA ATT lo" 

F K L T R H P V G L R V L G a T 
TTC AAG CTC ACC CGC CAT TTT GTA GGT. CTG AGG GTG CTT GGA CAT ACT CTT C GA GCT ACT uJj 

T f E f . L . L L I I F L A L G V L I v> \ ' 
ACT AAT, GAA TTT TTG CTG CTG ATA ATT TTC CTG GCT CTA GGA GTT TTG ATA TTT GCT ACC All 

M I Y Y A E R V G A Q P NOP s a c 
ATG ATC TAG TAT GCC GAG AGA GTG GGA GCT CAA CCT AAC GAC CCT TCA GCT AGT GAG CAC 12s! 

T Q F K N 1 PIG F W W A V V T M *P ~ 
ACA CAG TTC AAA AAC ATT CCC ATT GGG TTC TGG TGG GCT GTA GTG, ACC ATG ACT ACC CTG 1314 

GGT TAT GGG GM ATG TAC CCC CAA ACA TGG TCA gSc ATG CTG GTG GGA GCC CTG TGT GCT 13t1 

L A G V L T I A M P V P V I VMM „ ^ 
CTG GCT GGA GTG CTG ACA ATA GCC ATG CCA GTG CCT GTC ATT GTC AAT AAT TTT GGA A^G lill 

Y Y S L A M A K Q K L P R K U k v u ^ 

TAC TAC TCC TTG GCA ATG GCA AAG CAG AAA CTT CCA AGG AAA AGA AAG AAG CAC ATC CCT 14s! ' 

P A P Q A S S P T F C K T E L N M 
CCT GCT CCT CAG GCA AGC TCA CCT ACT TTT TGC AAG ACA GAA TTA AAT ATG GCC TGC AAT llU 

AGT ACA CAG AGT GAC ACA TGT CTG GGC aL GAC AAT CGA CTT CTG GAA CAT a5!c AGA TCA ill! 

V L S G D D S T G s E P P L S p p E n T „o 
GTG TTA TCA GGT GAC GAC AGT ACA GGA AGT GAG CCG CCA CTA TCA CCC CCA GAA AGG CTC 1674 

*~ 1 R R S S T R D K N R R G E T C F L T S7fl 
CCC ATC AGA CGC TCT AGT ACC AGA GAC AAA AAC AGA AGA GGG GAA ACA TGT TTC CTA CTG llll 

T T G D Y T C A S D G G I R K G Y p v o «,__ 
ACG ACA GGT GAT TAC ACG TGT GCT TCT GAT GGA GGG ATC AGG AAA GGA TAT GAA AAA TCC 1794 

CGA AGC TTA aL A^C ATA g£ G GGC TTG GCA GGC' AaV GCT CTG AGG CTC TCT CCA GTA ACA 1854 

S P " Y N S P C P L R R S R S P I p 'o T i fi ,o 
TCA CCC TAC AAC TCT CCT TGT CCT CTG AGG CGC TCT CGA TCT CCC ATC CCA TCT ATC TTG 1914 
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TAA 63 9 
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*<™~C;VACCAA^^^ 

TAGCTAATGAATCTTACCCAGGCTTTAAAGATTGTCT^ 
TTTTGC^TAAAATGACAGATAGTGTCAGATATTGACCAG^ 

AAAACAGTGTGCTTCCAAATGCCAAeCACTTCATTGGAACri^ATTT^ ' 
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Searching for complete domains in PFAM . , 

hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University 'school of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pfam6 . 4/Pfam 
/prod/ddm/wspace/or f anal /oa - script .4688. seq 



Query: Fbh53763 : 

i • 

Scores for sequence family classification (score includes all domains): 
Model Description Score E-value N 



K_tetra K+ channel tetramerisation domain 156.7 
1 

• ion^trans. Ion transport protein . 116.9 

l\ 

oxidored__q3 NADH- ubiquinone /pi as toqui none oxidoreduct -81.7 
1 



4e-43 
3.9e-31 
5.6 



Parsed for domains : 

Model Domain seq-f seq-t 



hmm- f hmm- 1 



score E-value 



K_tetra 1/1 8 156 

oxidored_q3 1/1 317 467 

ion trans 1/1 281 472 



111 (3 
177 [) 
223 [J 



156 . 7 
-81.7 



4e-43 
5.6 



116.9 3.9e-31 



Alignments of top-scoring domains: 

K_tetra: domain 1 of 1, from 8 to 156: score 156.7, E = 4e-43 

*->ErvrLNVGGkrFeTsksTLtrfkpdTlLgrllktdsd. . 

Erv+LNVGG+r+eT++sTL ++ p T+L 1++ S+++++ ++ + 
Fbh53 763 8 ERVILNVGGTRHETYRSTLKTL- PGTRLALLAS - -SEppgdcl t tag 51 

vhearlrlcd 

++ ++++++ +++++ ++ ++++++ +++ ++ + ++++++ d 

Fbh53763 52 dklqpsppplsppprapplspgpggcf eggagncssrggrA --SD 94 

f yddetgEyFFDRsPkhFet ILnf YRtGdGkLhrp . evcldsf leEleFy 
+ E+FFDR+P++F ++Ln+YRt GkLh+p +vc f+eEl+F+ 
Fbh53763 95 HPGGGR-EFFFDRHPGVFAYVLNYYRT- -GKLHCPaDVCGPLFEEELAFW 141 



Fbh53763 



gldelaiesCcedeY<- * 
g+de ++e+Cc+++Y 
142 GIDETDVEPCCWMTY 156 



oxidored_q3 : domain 1 of 1, from 317 to 467: score -81.7, E = 5.6 

* - >mtyivliLsillvlGf lgVaskpsPiYgaLgLivaggvGCGlvlslG 
+ +v iL + 1 +G+ g++sk + +++ +v ++ + + ++ 1 
Fbh53763 317 I IDFVAILPFYLEVGLSGLSSKAAKDVLGFLRWRFVRI -LRIFKLT 362 



Fbh53 763 



gsFvalvlFLIYLGGMlWFgYtvalateeyPEaWgsnkwwtigdgval 
Fv+1 ++ g t t e+ + ++i ++1 
363 RHFVGLRVL - -GHTLRASTNEF L LLI IFL 3 89 



vl gl 1 i e vl Lvg 1 vl gwt e wi wa 1 tgl Gdwvi Ydvegsg 

lg+li++ ++++ ++ + ++++++ e + +++ G w + v+ + 
Fbh53763 390 ALGVLIFATMIYYAErvgaqpndpSASEHTQFKNIP- IGFW- -WAWTM- 435 

1 i redl sGvaaLYs cgvwmf evaGwvLLval f wiel tR< - * 
++ G +Y +w + G L al +V++ + + + 
Fbh53763 436 TTLGYGDMYPQ-TWSGMLVG- -ALCALAGVLTIAM 467 
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ion_trans: domain 1 of 1, from 2 81 to 472: score 116.9, E .= 3.9e-31 

*->ilfildllfvllflleivlkfiayglkstsniaakylksifnildll 



Fbh53 763 



Fbh53763 



Fbh53763 



Fbh53 763 



Fbh53763 



1++++ ++v++f ++e++++++ + ++k 
28 1 ALTYVEGVCVVWFTFEFLVRIVFS PNK 



++k+ ni+d+ 
-LEFIKNLLNIIDFV 321 



ailplllllvlflsgteqvakkrlrer . f . slelsq.wyyriirf lrlLr 
• ailp+ + l ++1 ^"+++s + + ,+. +fl r + + r 

322 AILPFYLEVGL — - SgLsSKAAKDvL- « GFLRWR 351 

llRlLrllrllrrletlfefelgtlaWslqslgralksilrflllllill 
++R + lr++ +++ , + + + 1+ l g++ i ++ ++ +111+++1 

352 FVR- -.- ILRIFKLTR- ■? -HFVG LRVTiGHTLRASTNEFLLLIIFL 389 

• i ■ ' . 

igfsvigyllfkgyedlse. . . .nevdgnsef ssyf daf yf If vtl t tvG 
+ +i++ + ++ e+ +++ + +++ + f +++ +f ++++vt+tt+G 
390 ALGVLIFAJMIYYAERVGAq^ndPSASEHTQFKNIPIGFWWAVVTMTTLG 439 

fGdlvpv.wlgiiffvlffiivgllllriXliavi<-* ' 
+Gd++p ++++++g+l +++ ++++vi 
44 0 YGDMYPQtWSGMLVG-ALCALAGVLTIAMPVPVI 472 



// . 

Searching for complete domains in SMART 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 

HMM file: 
Sequence file: 



/ddm/ robison/smart/smart /smart .all .hmms 
/prod/ddm/wspace/orfanal/oa-script .4688 . seq 



Query: Fbh53763 

Scores for sequence family classification (score includes all domains) : 
Model Description Score 



BTB_4 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t 



BTB 4 



1/1 



8 



159 



. 1 



114 [] 



72.7 

score E-value 
72.7 7..8e-18 



E-value N 
7.6e-18 1 



Alignments of top- scoring domains: 

BTB _ 4: domain 1 of 1, from 8 to 159: score 72.7, E = 7 . 8e-18 

* - >cDvtlwggdlggdnaegkkf hasqHkavLaacrrdSpyFkalf es . 
++v+l+vgg ++ ++ - +++L + +++ +al+ s+ 
Fbh53763 8 ERVILNVGG- - TRHET- -YRSTLKTL PGTRLALLASs 40 



Fbh53 763 



Fbh53 763 



+ + +++ ++ +++ ++ ++++ +++ + + ++ + +++ + + + + + ++ ++++ + 
41 eppgdclttagdklqpsppplsppprapplspgpggcfeggagncssrgg 90 

......... .ieldDeallievspeaFralLnflYt.kldlpeedvenve 

+ ++++++++++++D ++p +F +Ln+++t+kl++p + ++ 

91 rasdhpgggrEFFFD- - - - - -RHPGVFAYVLNYYRTgKLHCPAD- - VCGP 13 2 



elLelAdf IdSYGqip . IvelCeef llknl< - * 
+ e+ f+ G+ ++ ve C+++ +++ 
Fbh53763 133 LFEEELAFW- - -GIDEtDVEPCCWMTYRQH 159 



// 
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LUS'TAL W (1.74) 'multiple sequence alignment 



bh53763pa t 
atCIKE 



bh53763pat 
atCIKE 



MGKIENNERVIL^^/GGTRHETYRSTLKTLPGTRLALliASSEPPGDCI J TTAGDKLQPSPPP 
MGKIENNERVILNVGGTRHETYRSTLiKTLPGTRIALLASSEPQGDCLTAAGDKLQPLPPP 
******** * * ****** * * ************************ *** * * .******* * * * 

» * » • 

LSPPPRAPPLSPGPGGCFEGGAGNCSSRGGRASDHPGGGREFFFDRHPGVFAYVlijYYRT 
LSPPPRPPPLSPVPSGCFEGGAGNCSSHGGNGSDHPGGGREFFFDRHPGVFAYVLNYYRT 

******* ***** * ************ : ********.****** ***** * A ******* ** 



bh53763pat 
atCIKE 



bh537€3pat 
atCIKE 



bh537 63pat 
atCIKE 



bh53763pat 
atCIKE 



bh53763pat 
atCIKE 



bh53763pat 
atCIKE 



G KLHC PADVCG PLFEEELAFWG I DETD VE PCCWMTYRQHRDAEEALD I FET PDL I GGD PG 
GKJ^HCPADVCGPLFEEELAFWGIDETDVEPCCWMTYRQHRDA^EAIJDIFETPDLIGGDPG 
************************************************************ 

•DDEDLAAKRI^IEDAAGLGGPDGKSGRWRRLQPR^ 

nnPim .rt^KP IX^J KDAAGLGGPDGKSGRWRKIjQPRMWAIjFEDP YSSRAAF gFI AFASLFF I L 
***** **********************: ****************************** 



VSITTFcQeTHEAFN I VKNKTE PV INGTSWLQYEIErTDPALTYVE^VCWWFTFEFLVR 
VSITTFCLjg THEAFNIVK^KTEPVINGTSAVIiQYEIETDP 

****** *********************** . ****************************** 

"Trvy 



IVFbPNKLEFI KltfLLNI IDFVAILPFYLEVGLfeGLSSKATVKDVLC^LRVVRFVRI L#RI FK 
iypfe PNKLEFI K ^LNIIPFVAILPFYLEVGIiB GLS SKAAKDVL GpLRVVRFVR I IvRI FK 
************************** **************************** ****** 

^ x ^ -r T -V 



L»TRH FV G i4RVXjGHT LRA S TNEjF LiL L» 1 1 FLAIX5VX.I FATNJI YY^JERVGAQPNDPSAS EHTQ 

[yyKei 



Li TRHFVGI RVT^HTLRASTN EtFLLLI I FIALGVL, I FATMI YYj AERVGAQPNDPSASEHTQ 
****** * * **************************************************** 



FKNIFflGFWWAWTMTTLGYG DMYPQTWSC ML.VGAL.CALAGVL.TIAMPVPVI V ^FGMYY 
FKNI p(l GFWWAVVTMTTIjG Y^>MY PQTW S QMLVGALCALAGVLT I AMPVP V I Vy flNFGMYY 
************************************************************ 



bh53763pat 
atCIKE 



SIiAMAKQKLPRKRKKHI PpAiPQASSPTFCKTEIiNMACN STQSDTCI/SKDNRIjIiEHNRSVlt 
SLAMAKQKLPRKRKKHI PPAPLAS S PTFCKTEUSIMACNSTQSDTCLiGKENRIjLEHNRS VX» 
********************* *********;*****************:*********** 



*bh53763pat 
atCIKE 



SGDDSTGS EPPLS PPERIjPI RRS STRDKNRRGETCFLIjTTGDYTCASDGGI RKG YEKS RS 
SGDDSTGSEPPLSPPER1.PIRRSSTRDKNRRGETCFL1jTTGDYTCASDGGIRKGYEKSRS 
************************************************************ 



'bh53763pat 
•atCIKE 



I^N I AGLAGNALRLSPVTS P YNS PC PLRRSRS P I PS I L 
LNNIAGLAGNAIiRIiS PVTS PYNS PC PLRRSRS PIPSIL 
************************************** 
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; I t , r 

Input file Fbh6707 6FL.seq; Output File Fbh{S707 6FL . tra* 
Sequence length' 6582 

CCACGCGTCCGGCCACGCGTCCGCCCACGCGTCCGAGAAGGCTTAGGTGGGCAGGCAGGACGAGAGAAAGACTGAGAGG 
AGGGAAAGCCGCGTAGGTGGGAGTACAGCGGCGCGAGGGTCGAGGGGGAACCCTCGTCGGTOCAGATGAGGAGGGTGGG. 

CTTTCAGAACTAGTCCCCCCTCGCACC<*CGCCCCGCCCCTCCCGCGCTGGpGTCTTCACGGTC 

\ ' * i 

GGCTCCACCACGCCCGGAAGAGGGAGTCTGGCCGTCGGCTGGCTCA 

i 

GCCCGAGTGCGGGACCAGAGCGCGGGGCGGCGCGGCAGCGGCGGGCCGAGGAGGGGCTGCGAGCGAAACGGCGCGGCGC 

i ' 1 ' ' ' . 

GGCACGGCGGACGAGTTAGGGGtGGGGCGAGGGAGGCTGTGGCTCCCGACAGAGACAGGGGAGTAGTGTCGGGCTGAGG 

' M F R R S L N 7 

CGAGACAGCCCGGTAGAGCCCAGCTCAGCGCCCGGCAGCCTTCGACGCQ ATG TTC CGC CGG AGC TTG AAT 21 

R F C A G E E K R V G ' , T R T V F V G N H 27 

CGT TTT TGT GCT GGA GAA GAG AAA CGA GTT GGC AC A CGC ACA GTG TTT GTT GGC AAT CAT 81 

P V S E T E A Y* I A Q R F C D , N R I V S 47 

CCA GTT TCG GAA AC,A GAA GCT TAG ATT GCA CAA AG A TTT TGT GAT AAT AG A AT A GTC TCA 141 

S K Y T L W N F L P , K N L F E ' Q F R R I 67 

TCT AAG TAT ACA CTT TGG AAT TTT CTC CCA AAG AAT CTG TTT GAA CAG TTT AGA AGA ATT 201 

A n F Y F b I I F L V Q V T V D T P T S 87 

GCA AAT TTT TAT TTT CTC ATA ATC TTC CTT GTA CAG GTC ACA GTA GAC ACA CCA ACT AGC 261 

p v T S G U P L» F F V. I* T VTA I *K Q G 107 

CCA GTT ACC AGT GGA CTT CCA CTT TTC TTT GTT ATA ACT GTT ACA GCC ATC AAG CAG GGA 3 21 

Y E D W L R H R A D N E V N K S T V Y I 127 

TAT GAG GAT TGG CTG AGA CAC AGA GCT GAC AAT GAA GTC AAC AAA AGC ACT GTT TAC ATT 3 81 

I E N A K R V R K E S E K I K V G D V V 147 

ATT GAA AAT GCA AAG CGA GTG AGA AAA GAA AGT GAA AAA ATC ,AAG GTT GGT GAT GTA GTA 441 

E V Q A D E T F P C D L I L h S S C T T 167 

GAA GTA CAG GCA GAT GAA ACC TTT CCC TGT GAT CTT ATT -COT CTA TCA TCT TGC ACC ACT 501 

D G* T C Y V T T A S ti D G E S N C K T H 187 

GAT GGA ACC TGT TAT GTC ACT ACA GCC AGT CTT GAT GGG GAA TCC AAT TGC AAG ACA CAT 561 

y A V R D f I A L» C T A E S I'D' T L R A 207 

TAT GCA GTA CGT GAT ACC ATT GCA CTG TGT ACA GCA GAA TCC ATC GAT ACC CTC CGA GCA 621 

A I E C EQ P Q P D L Y K F VG R I N I 227 

GCA ATT GAA TGT GAA CAG CCT CAA CCT GAC CTC TAC AAA TTT GTT GGG CGA ATC AAT ATC 681 

y S s L E A V A R S L G P E N L L L» K 247 

TAC AGT AAT AGT CTT GAG GCT GTT GCC AGG TCT TTG GGA CCT GAA AAT CTC TTG CTG AAA 741 

G A T L K N T E K I Y G V A V Y T G M E 267 

GGA GCT ACG CTA AAA AAT ACC GAG AAG ATA TAT GGA GTT GCT GTT TAC ACT GGA ATG GAA 801 
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T K M A L N Y Q G K S Q K ' R ' S A V E K S 287 
ACC AAA ATG GCT TTG AAC TAC CAA GGG AAA TCT CAG AAA' CGT TCT GCT GTT GAA AAA TCT 861 

I N A F L.I V V L F I L I* T K AAV C T 307 
ATT AAT GCT TTG CTG ATT GTA TAT TTA TTT »ATC TTA CTG ACC AAA GCT GCA GTA TGC ACT 921 

.?* • , 

T L K. Y V W Q S- T P Y N , D E »P W Y N ' Q K 327 
ACT CTA AAG TAT GTT TGG CAA AGT ACC CCA TAC AAT GAT GAA CCT TGG TAT AAC CAA AAG 981 

T O K E R E T L ( K V L K M F T D F L S F 347 

ACT CAG AAA GAG CGA GAG ACC TTG AAG GTT TTA AAA ATG TTC ACC 6AC TTC CTA TCA TTT 1041 

I ■ ' ' I. • . 

M y L F N F I I • 'P V S M Y V T V E M Q K 367 

ATG GTT CTA TTC AAC- TTT ATC ATT CCT GTC TCC ATG TAC GTC ACA GTA GAA ATG CAG AAA 1101 

F L G S F F I S W D K D F Y D E E I N E 387 

TT*C tTG GGC TCC TTC TTC ATC TCA TGG GAT AAG GAC TTT TAT GAT GAA GAA ATT AAT GAA 1161 

G A LV N T S D L N E E L G Q V D Y V F 407 

GGA GCC CTG GTT AAC ACA TCA GAC CTT AAT GAA GAA CTT GGT CAG GTG GAT TAT GTA TTT 1221 

T D K T G T Li T ENS M E F I E C C I D 427 

ACA GAT AAG ACT GGA ACA CTC ACT GAA AAC AGC ATG GAA TTC ^ ATT GAA TGC TGC ATA GAT 1281 

G H K y K G V T Q E V D G L S Q T D G T 447 

GGC CAC AAA TAT AAA GGT GTA ACT CAA GAG GTT GAT GGA TTA TCT CAA ACT GAT GGA ACT 1341 

L T Y F D K V D K N R E E L F L R A L C 467 

TTA ACA TAT TTT GAC AAA GTA GAT AAG AAT CGA GAA GAG CTG TTT CTA CGT GCC TTG TGT 14 01 

L C H T V E I K T N D A V D G A T E S A 487 

TTA TGT CAT ACT GTA GAA ATC AAA ACA AAC GAT GCT GTT GAT GGA GCT ACA GAA TCA GCT 14 61 

E L T y I S S S P D E I A h V K G A K R 507 

GAA TTA ACC TAT ATC TCC TCT TCA CCA GAT GAA ATA GCT TTG GTG AAA GGA GCT AAA AGG 1521 

Y G F T F L G N R N G Y M R V E N Q R K 527 
TAC GGG TTC ACA TTT TTA GGA AAT CGA AAT GGA TAT ATG AGA GTA GAG AAC CAA AGA AAA 1581 

E I E £ Y E L L H T L N F D A V R R R M 547 

GAA ATA GAA GAA TAT GAA CTT CTT CAC ACC TTA AAC TTT GAT GCT GTC CGG CGA CGT ATG 1641 

S V ' I- V K T Q E G D I L X. F C K G A D S 567 

AGT GTA ATT GTG AAG ACT CAA GAA GGA GAC ATA CTT CTC TTT TGT AAA GGA GCA GAC TCG 1701 

A y F P R V Q N H E I E L T K V H V E R 587 

GCA GTT TTT CCC AGA GTG CAA AAT CAT GAA ATT GAG TTA ACT AAA GTC CAT GTG GAA CGT 1761 

M A M D G Y R T L C V A F K E I A P D D 607 

AAT GCA ATG GAT GGG TAT CGG ACA CTC TGT GTA GCC TTC AAA GAA ATT GCT CCA GAT GAT 1821 

Y E R I N R Q L I E A K M A L Q. D R E E 627 
TAT GAA AGA ATT AAC AGA CAG CTC ATA GAG GCA AAA ATG GCC TTA CAA GAC AGA GAA GAA 1881 

K M E K V F DDI E T N M N L I G A T A 647 

AAA ATG GAA AAA GTT TTC GAT GAT ATT GAG ACA AAC ATG AAT TTA ATT GGA GCC ACT GCA 1941 
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V E D K L Q D Q A A » E T I E A L H A A G 667 

GTT GAA GAC AAG CTA CAA GAT CAA GCT GCA GAG ACC ATT GAA GCT CTG CAT GCA GCA GGC 2001 

L K V W V L T G D K M E T A K S T C Y A 687 

CTG AAA GTC TGG GTG CTC ACT GGG GAC AAG ATG GAG ACA GCT AAA TCC ACA TGC TAT GCC 2061 

C R L F Q T N T E L L E L T T K T I ' E E 707 

TGC CGC CTT TTC CAG ACC AAC ACf GAG CTC TTA GAA CTA ACC ACA AAA ACC ATT GAA • GAA ^ 2121 

i 

• S B R K E, D R L H E L L I E Y R K K L L 727 

AGT GAA AGG AAA GAA* GAT CGA TTA CAT GAA TTA TTG ATA GAA TAT CGC AAG AAA TTG CTG 2181 

A . • . _ 

H E F P K ' S » T . R 'S F K K A W T E H Q E Y 747 

CAT GAG TTT CCT AAA AGT ACT AG A AGC TTT AAA AAA GCA TGG ACA GAA CAT CAG GAA TAT 2241 

G L I * I D G S T L S h I L N S S Q D S S 767 

GGA TTA ATC ATA GAT GGC TCC ACA TTG TCA CTC ATA .CTA AAT TCT AGT CAA GAC TCT AGT 2301 

■ I 

S.N N Y K S I F L Q I , C M K C T A V L» C 787 

TCA AAC AAT TAC AAA AGC ATT TTC CTA CAA ATA TGT ATG AAG TGT ACT GCA GTG CTC TGC 2361 

I * . 

C R M A P L Q K A Q I , V R.M V» K N L K G 807 

TGT CGG ATG GCA qCA TTA CAG AAA GCC CAG ATT GTC AGA ATG GTG AAG AAT TTA AAA GGC 2421 

S P I T L S I G D G • A N D V S M I L E S 827 

AGC CCA ATA ACT CTG TCG ATA GGT GAT GGT GCC AAT GAT GTT AGT ATG ATC TTG GAA TCC 2481 

H VGIGIKGK E G R Q A A* R N S D Y 847 

CAT GTG GGA ATA GGT ATT AAA GGC AAA GAA GGT CGC CAA GCA GCT AGG AAT AGC GAT TAT 2541 

S V P K F K H L K K W L L A H G H * L Y Y 867 

TCT GTT CCA AAG TTT AAA CAC TTA AAG AAA CTG CTG TTG GCT CAT GGA CAT CTA TAT TAT 2601 

V R I A H . L V Q Y F F Y K N L C F I L P 887 

GTG AGA ATA GCA CAC CTT GTA CAG TAC TTC TTC TAT AAG AAC CTT TGT TTC ATT TTG CCA 2 661 

q p L Y Q F F C G F S * Q Q P L Y D A A Y 907 

CAG TTT TTG TAC CAG TTC TTC TGT GGA TTC TCA CAA CAG CCA CTG TAT GAT GCT GCT TAC 2721 

L T M Y N I C F T S L P I L A Y S L L E 927 

CTT ACA ATG TAC AAT ATC TGC TTC ACA TCC TTG CCC ATG CTG GCC TAT AGT CTA CTG GAA 27 81 

QHINIDT LTSD P R LYMKI S G 947 

CAG CAC ATC AAC ATT GAC ACT CTG ACC TCA GAT CCC CGA TTG TAT ATG AAA ATT TCT GGC 2841 

N AM- L Q L G P F L Y W T F L A A F E G 967 

?lAT GCC ATG CTA CAG TTG GGC CCC TTC TTA TAT TGG ACA TTT CTG GCT GCC TTT GAA GGG 2901 

T v F F F G T Y F L F Q T A S LEE N G 987 

ACA GTG TTC TTC TTT GGG ACT TAC TTT CTT TTT CAG ACT GCA TCC CTA GAA GAA AAT GGA 2961 

K VY G NW T F G T I V FT V L VF T V 1007 

AAG GTA TAC GGA AAC TGG ACT TTT GGA ACC ATT GTT TTT ACA GTC TTA GTA TTC ACT GTA 3021 

T L k L A L D T R F W T W I N H F V I W 1027 

ACC CTG AAG CTT GCC TTG GAT ACC CGA TTC TGG ACG TGG ATA AAT CAC TTT GTG ATT TGG 3081 
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G S L, A F Y V F F S F F W G G ' I I W P F . 1047 
GOT TCT TTA GCC TTC TAT GTA TTT TTC TCA TTC TTC TGG GGA GGA ATT ATT TQG CCT TTT 3141 

L K Q 0 R M Y F V F A Q m'.L S S V S T W 1067 
CTC AAG CAA CAG AGA ATG TAT TTT GTA TTT GCC CAA ATG CTG TCT TCT GTA TCC ACA TGG 3201 

L A 11 L L I F I S ' U F P ,E I ; L L I V . L 1087 

TTG GCT ATA ATr CTT CTA ATA' TTT ATC AGC CTG TTC CCT GAG ATT CTT CTG ATA GTA TTA 3261 

I 1 

K N V R R R S A R R N L S C R R ' A S D S 1107 

AAG AAT GTA AGA AGA AGA AGT GCC AGG AGA AAT CTG AGC TGT AGA AGG GCA TCT GAC TCA 3 321 

l • 

L S A R P S V R P L L L R /T F S D E S N 1127 

TTA! TCC GCC AGA CCT TCA GTC AGA CCT CTT CTT TTA CGA ACA TTC TCA GAC GAA TCT AAT 3381 

i ...» • 

v L ♦ - 1130 

GTA TTG TAA 3390 

CAGAATCCGAATCTTGAACTGCCTATGTTATTGTCCTACA 

AAGAAACAACTACAAAAAGTTATCATCTCAGGATACTTGATATGCAACACACTAAACCACrc 
AATAAATGTTCATTAAAATACCAAATGATTCTCTTAAGCATTTACCA 

aagi^aagaattatatgaaagttgaaack:aagaatacttagaat^ 

TGCTCTTTTAACCCATGAACTTTGTGAATGGATTTAAATAC 

G ATTTTGCTAATAC TACTTTTTTTGCCTGGC AG AAG AAATAG AC TATTTGG ATCAC ATTTCTC ATTCCTCCTAAATG AT 
CATCTTAATTTTTTTTCCCAAGTACATAAGGAAT^^ 

AATAGTTAATCCCTTCTGTTTACCCATGTGCTACTAATGTCTTGGTAGA^^ 

TTATGTGGAAAGTGTTAACTTACGGGTATTTTTGTGGGAATA 

CCCCACTTATGGGTGTAAGCCTACTAGACTTGAAAATAAAGTATAAAACATTTC 

AGTTAGAAAATAAACAGATTTTTCCAGTGTTGATTTTAC 

AAATAAAGGTCATTCTGAATATCAGCCTTTTATAATTTTATC 

CGGTTTTTATTTGAAAGAGATTGCATTTATGCATATATGCAGTGCT 

AGGAGCCACCfcCWU^^ 

GG AAATGGTCTAACTCTAAACCATGTAACTGACCTTAGTAAAGTCPTTGACTAACTGAA 

TCTAATTAGTTC^CTTGAAACATAAATGTGAAA 

C ATATTTATTTG ACTGCTAGTTTTTTTGTT^^ 

TACCTTGATTTGGAAAAGTATTGGAGTTAATCTGTATTATAT^ 

ATATTTTGTGTTAATGTTTAGGTATGATTTTTTTC 

CATCATTATAGACCCrrTTTTCATTATTTCATTT^ 
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tcacaacttacctaagtgtgctgtgttctggtagcccgtatttg 
ttgcctattccAaagagctaaaaaagtctaacccaggaaagcttttgat^ 

■ - I t • -• ■ 

TGTTGTTGCTGTATTATGATTGCTCTTTTACATAAAAT^ 

AGGCTTGTTTAATGCAGTACCATTGGAGAGTTAACAGAATAAT 

• • ' ■ • 

TCCAGCCAGAAAGAAAGAAAGACAAGGAGTAAGGGGGATTTAGAGT^ 
CAGCTCAAATTCAGAATGGCAATGATACATGATATCATG 

GATATTTTACTGAAGAGCTAuAAAACTGGCCAGTGTGGGGTTAGCAGATCGAATAACTTGAAATAGACCGTC 

CTAGdACjTCAATGTAATCACCCTATTTGTGACApAGAAAGGGAAAAAAATATAATAAGA 

TAATTTTGAGCTATCAAAATGTCTTTGTAATTTTCACAACCGCTC 

AACATTCATTCCATATCTACTTACACATACACCAGCAAC AGTATAAATGTAAGCCTAACTTTGCAAAATTCGTAATAAT 

TTAGTGATGGAATTTTTTAATAACATGCAGTATATAAATGTGCAGATTT^ 

TGCAAAATGGGACTGCAATATTACATTTTTCACTTAAGCAGTTT^ 

GAATGCCATCTTTTATGACTGCAACTTGCCTTT^ 

GATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Protein Family / Domain Matches, HMMer version 2 
Searching for complete domains in PFAM 

hmmpfam - search a single seij against HfoM database ' 
HMMER 2.1.1 (Dec 1998) 

Copyright <C) 199-2-1998 Washington University School of Medicine r 
HMMER is freely distributed under, the iCNU General Public License (GPL) 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pf am6 . 4 /Pf am 
/prod/ddm/wspace/orf anal /oa- script . 13758 . seq 



Query: 67076 ( ' 

Scores for sequence family plassif idat ion (score includes all domains) ■ 
Model Description . Score E-value N 



Hydrolase ( haloacid dehalogenase-like hydrolase 



Parsed for domains: 

Model Domain seq-f seq-t 



Hydrolase l/i 



403 



837 



hmm- f hmm- t 

1 184 [I. 



12.7 



score E-value 



0.019 



12 .7 



0.019 



Alignments of top- scoring domains: ' , 

Hydrolase: domain 1 of l, from 403 to 837: score 12.7, E = 0.019 

* - >ikavvF©kDGTLtdgkeppiaeaivealrelgl 

++ vV Dk+GTLt+ + , e + + + + +g+4 J+ + ++■++++++" 
67076 , 403 VDYVFTDKTGTLTEN-SMEFIECC IDGHKYKGVtqevdglsqtdgtl 448 

.apleevekllgrg"! .gerilleggltaell .......... 

+ + + + el +r+l ++++ t +++++ +++ + + ++++ 

67076 449 tyf dKVDKNREELFLRALcL- -CHTVEIKTNDAVdgatesael tyisssp 496 



++. ++■ ++ + + ++++ + + ++ +++ ++ + ++ + + uf + + 
67076 497 deialvkgakrygf tf lgnrngymrvenqrkeieeyellhtlnf davrrr 546 



67076 



+ ++ +++ ++ ++ ++ +++ + ++ + +++ ++ ++ 

54 7 msvivktqegdillfckgadsavfprvqnheieltkvhvernamdgyrtl 596 



67076 



67076 



67076 



• ; • ; .ld.evlgli 

+ + +++++ ++ ++ +++ + +g + 

597 cvafkeiapddyerinrqlieakmalqdreekmekvfddieTNmNLIGAT 646 



++ +++ ++ . ++ 



al . dklypgarealkaLkerGikvailTngdr . naeal le algla 

a++dkl + a+e+++aL+++G+kv++lT++ ++a+ + +++ 

647 AVeDKLQDQAAET I EALHAAG^KVWVLTGDKMeTAKSTC Yac r 1 f QTNTE 696 

. lfdaivdsdevggvgpvwgKPkpeif llalerlgvkpeevg ....... 

1 +++e + k + ++++1 ++++k ++ +++++ 

697 1LELTTKTIEESE RKEDRLHELLIEYRKKLL- -Hef pkstr 735 



+ ++ +++++ + ++++ + +++ + ++++++ ++ + + + 
67076 736 sf kkawtehqeygliidgstlslilnssqdsssnnyksif Iqicmkctav 785 

4 .... .p.kvlmvGDginDapalaaAGvgvamgn 

+ + ++ + + ++ +GDg nD+ ++ +vg+ + 

67076 786 1 ccrmap 1 qkaqi vrmvknlkgSp I TLSIGDGANDVSM I LESHVGIG I KG 835 

gg<-* 

67076 836 KE 837 
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USTAL W (I-.74) multiple sequence alignment 



h67076FL 
useATlH 



h67076FL 
useATlH 



h6707SFL 
useATlH 



.h67076FL 
aUseATlH 



,h67076FL 
>useATlH 



>h67076FL 
mseATlH 



>h67076FLi 
>useATiH 



>h67076FL 
DUseATlH 



Dh67076FL 
^useATlH 

Dh67076FL 
ouseATlH 



bhS7076FL, 
ouseATlH 



bh67076FL 
OuseATlH 



t)h67d76Flj 
tOUSeATlH 



*bh67076FL 
touseATlH 

•"bh6707 6FL 



-MFRRSLNRFCAGEEKRVGTRTVFVGN -HPVSETEAY IAQRFGDNRI VSSKYTLWNF 

" - -n-v\ . -r/^-u : , 

T lPKNLFEQFRRlANFYFLh : I FLVQVTVDtTPTS PVTSGLPLFFVT TVTAI WQG Y ED WLRHR 
.********»************** : ;************************** "," w " : 
ADNEVNKSTVYI I ENAKRVRKESEK1 KVGDWEVQADETF PCDL I LLSSCTTDGTCY VT© 

JESSES^ 

* *- * , * i i • * ***;*.*?• • * " • ■ — s\ 

A <^T^SeSIJCKTHYAVRDTI ALCTAES I DTLRAAI ECEQ PQ PDI/YKFVGRI NI Y SN S LEA V 

JIMSesshkthyav^ 




TP-J .. 

T piLLTKAAVCTTLKYAmb sTPYWDEPVm^QKTQKERETLKVLKMFTD^ I 




** **::*:: . • . - ... 



j^ii^iSGi^^ 
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useATlH 



h67076FL, 
useATlH 



h67076FL 
useATlH 



h67076FL 
useATlH 



•h6 7076FL 
»useATlH 



ILEAHVGIGVIGKEGRQAAR^SDYAIPKFKHLKJWLLVHGH 

** * .*****. ************* *****'*** : ** »** V ; ** : * * : m ********* . * 



" FI LPQFLYQI^FGGFSQQPLYE^AYLTMYNICFTSLPI LA YSLI^SQHI N I DTLTS DPRL, YM 

rDT tAYLTLYNISFTSLPILLYSI> 1EC 
* * * - * 



F I F PQFL YQ F] FCG PSQQTL YDT lA Y LTL YN I S FTS L P I L L YSIr lEQHVG I D VLKRD PTL YR 
** : * ************* _*** : **** : ***^ ******* ***:***:.**.*. * * ** 



ki sgnamlqlgi/flywtflaaf 

DI AKNALLRWR\ 1 1 YWTFLiGVFDALVFFFG AYF ijFENTTVT I NGQMFGkJWTFGTT . VPfx^ 



* . * * . * 



******* .'.*-. *****•**;* 



** . . ;******* . * * * * . 



VFTVTL^LAIjDTRFWTW INHFVIWGSLAFYVFFSFFWG 31 IWPFLiKQQRMYFVFAQMLSS 
VLTVTLKLALDfr HYWTW tl^FVIWGSLL^ 

a.**********..************* **. .'**'. -*•**•♦**** T ****:** a **** 

^ Tr^\* ' » 

VSTWIiAIILLIFISLFP£:i LLI>7l) K -NVRRRSARRNLSCRRASDSLSAR 

GPAWLgllXiIjVTVG^ 

.****** . . ^ * : * : : * .*★•/. ::: : *..*:" 



»h67076FL PSVRPLLL^TFSDESNVL- 

>UseATlH GSRLLAAOCSSPSGRVVCSRWESEECPVLPLHPGLPHKARYGCCRSSLEMPT 

* * * 



* * • * * * 
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. . . 1 • • s , r 

Input file Fbh67102FL.seq; Output File Fbh6,7102FL. tra 

Sequence length* 6074 

CCACGCGTCCGGGAGGAGCGGAGGGAGAAGTAGGTTGCGAGCTCAGCACAGGCTTC^ 
TTGGGAGATGTCTAAGTGATTTTTTTTTTTTCCCGGAAGG 

CG AATTTGTGCTTAGCTCTTTTCTTGTACCTTGCG actcg^ accaac atgctgtgatgtgtgccgagggaggaattcjg 

I I .... . , 

M T E A L Q WARY ' 10 

TCAGCTACACAACCTGGATCtTACCACAGTTTGGAT ATG ACT GAG GCT CTC CAA TGG GCC AG A TAT 30 
I ' 

H W R R L I ,R G A T R D D D S G P Y N Y 30 

CAC TGG CGA CGG CTG ATC AG A GGT GCA ACC AGG GAT GAT GAT TCA GGG CCA TAC AAC TAT 90 

S S L I A C G R K S S. Q I P K L» S G R H 50 

TCC TCG TTG CTC GCC TGT GGG CGC AAG TCC TCT CAG ATC CCT AAA CTG TCA GGA AGG CAC ' 150 

R I V V P H I Q P F K * D E Y E k! F S G A 70 

CGG ATT GTT GTT CCC CAC ATC CAG CCC TTC AAG GAT GAG TAT GAG AAG TTC TCC GGA GCC 210 

y v N N R I R T'T K Y T L L N ( F.V P R N • 90 

TAT GTG AAC AAT CGA ATA CGA ACA ACA AAG TAC ACA CTT CTG AAT TTT GTG CCA» AGA AAT 270 

i ■ . 

b F E Q F H R A A S ,L, Y F L F L V V L N 110 

TTA TTT GAA CAA TTT CAC AGA GCT GCC AGT TTA TAT TTC CTG TTC CTA GTT GTC CTG AAC 330 

W V P L V E A F Q K E 1 T M L P L V V V 130 

TGG GTA CCT TTG GTA GAA GCC TTC CAA AAG GAA ATC ACC ATG TTG CCT CTG GTG GTG GTC 390 

b T I I A I K D G L. E ,D Y R K Y K I D K 150 

CTT ACA ATT ATC GCA ATT AAA GAT GGC CTG GAA GAT. TAT CGG AAA TAC AAA ATT GAC AAA 4 50 

Q I N N L I T K V Y S R K E K K Y I D R 170 

CAG ATC AAT AAT TTA ATA ACT AAA GTT TAT AGT AGG AAA GAG AAA AAA TAC ATT GAC CGA 510 

C W K D V T V G D F I .R I* S C N E V I P 190 

TGC TGG AAA GAC GTT ACT GTT GGG GAC TTT ATT. CGC CTC TCC TGC AAT GAG GTC ATC CCT 570 

A D M V t» h F S T D P D G I C H I E T S 210 

GCA GAC ATG GTA CTA CTC TTT TCC ACT GAT CCA GAT GGA ATC TGT CAC ATT GAG ACT TCT 630 

G L D G E S N L K Q R Q V V R G Y A E Q 230 

GGT CTT GAT GGA GAG AGC AAT TTA AAA CAG AGG CAG GTG GTT CGG GGA TAT GCA GAA CAG 690 

D S E V D P E K F S S R I E C E S P N N 250 

GAC TCT GAA GTT GAT CCT GAG AAG TTT TCC AGT AGG ATA GAA TGT GAA AGC CCA AAC AAT 7 50 

DLSRF RG FLEHS NKERVG LS 270 

GAC CTC AGC AGA TTC CGA GGC TTC CTA GAA CAT TCC AAC AAA GAA CGC GTG GGT CTC AGT 810 

K E N L 1# 1* R G C T I R N T E A V V G I 290 

AAA GAA AAT TTG TTG CTT AGA GGA TGC ACC ATT AGA AAC ACA GAG GCT GTT GTG GGC ATT 870 

V V Y A G H E T K A M L N N S G P R Y K 310 

GTG GTT TAT GCA GGC CAT GAA ACC AAA GCA ATG CTG AAC AAC AGT GGG CCA CGG TAT AAG 930 
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R S K ' V E R R A N T D V L W C V M L L V 330 

CGC AGC AAA TTA GAA AGA AGA GCA AAC ACA GAT GTC CTC'TGO TGT GTC ATG CTT CTG GTC 990 

I M C L T G A V G H G I W L S R Y E K M 350 

ATA ATG TGC TTA ACT GGC GCA GTA GGT CAT GGA ATC TGG CTG AGC AGG TAT GAA AAG ATG 1050 

H F F N V P E » P D G . H I I S P ' h L A G F 370 

CAT TTT TTC AAT GTT CCC GAG CCT GAT GGA CAT ATC ATA TCA 'CCA CTG TTG GCA GGA TTT 1110 

Y M F W T M I I L L Q V L I PIS L ■ Y V 390 

TAT ATG TTT TGG ACC ATG ATC ATT, TTG TTA CAG GTC TTG ATT CCT, ATT TCT CTC TAT GTT 1170 

S »I ( E . I . V K L G Q I Y F I Q S D V D F Y » 410 

TCC ATC GAA ATT GTG AAG CTT GGA CAA ATA TAT TTC ATT CAA AGT GAT GTG GAT TTC TAC 123 0 

N E K M D S I V Q C R A L N I A E D L G 430 

AAT iGAA AAA ATG GAT TCT ATT GTT CAG TGC CGA GCC CTG AAC ATC GCC GAG GAT CTG GGA 1290 

Q 10 Y L F S D K T G T L T E N KM V F 450 
CAG ATT CAG TAC CTC TTT TCC GAT AAG ACA GGA A(?C CTC ACT GAG AAT AAG ATG GTT TTT- 1350 

R R C S V A G F D Y C H E E N A R R L E 470 

CGA AGA TGT AGT GTG GCA GGA TTT GAT TAC TGC CAT GAA GAA AAT GCC AGG AGG TTG GAG 1410 

S Y Q E A V S E D E D F I D T V S G S L» 490 

TCC TAT CAG GAA GCT GTC TCT GAA GAT GAA GAT TTT ATA GAC ACA GTC AGT GGT TCC CTC 1470 

S N M A K P R A P S C R T V H N G P L . G 510 

AGC AAT ATG GCA AAA CCG AGA GCC CCC AGC TGC AGG ACA GTT CAT AAT GGG CCT TTG GGA 153 0 

N K P S N H L A G S S F T L G S G E G A 530 

AAT AAG CCC TCA AAT CAT CTT GCT GGG AGC TCT TTT ACT CTA GGA AGT GGA GAA GGA GCC 1590 

S E V P H S R Q A A F S S P I E T D V V 550 

AGT GAA GTG CCT CAT TCC AGA CAG GCT GCT TTC AGT AGC CCC ATT GAA ACA GAC GTG GTA 1650 

P D T R L L D K F S Q I T P R L F M P L» 570 

CCA GAC ACC AGG CTT TTA GAC AAA TTT AGT CAG ATT ACA CCT CGG CTC TTT ATG CCA CTA 1710 

D E T I Q N P P M E T L Y.I I D FF I A 590 

GAT GAG ACC ATC CAA AAT CCA CCA ATG GAA ACT TTG TAC ATT ATC GAC TTT TTC ATT GCA 1770 

L A I C N T V V V S A P N Q P R Q K I R 610 

TTG GCA ATT TGC AAC ACA GTA GTG GTT TCT GCT CCT AAC CAA CCC CGA CAA AAG ATC AGA 1830 

H P S L. G G L P I K S L E E I K S L F Q 630 

CAC CCT TCA CTG GGG GGG TTG CCC ATT AAG TCT TTG GAA GAG ATT AAA AGT CTT TTC CAG 1890 

R W S V R R S S S P S L N S G K E P S S 650 

AGA TGG TCT GTC CGA AGA TCA AGT TCT CCA TCG CTT AAC AGT GGG AAA GAG CCA TCT TCT 1950 

G V P N A F V S R L P L F S R M K P A S 670 

GGA GTT CCA AAC GCC TTT GTG AGC AGA CTC CCT CTC TTT AGT CGA ATG AAA CCA GCT TCA 2010 

P V E E E V S Q V C E S P Q C S S S S A 690 

CCT GTG GAG GAA GAG GTC TCC CAG GTG TGT GAG AGC CCC CAG TGC TCC AGT AGT TCA GCT 2070 
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TGC TGC ACA GAA ACA GAG AAA CAA CAC GGT GAT GCA. GGC CTC CTG AAT GGC AAG GCA GAG 2130 
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K A G I K I W M L T G D K Q E T A V N I- 930 

AAA GCG GGC ATC AAG ATC TGG ATG CTG ACA GGG GAC AAG CAG GAG ACA GCT GTC AAC ATA 2790 

a v « r K L It E P D D K L V • I L N T Q S 950 

GCT TAT GCA TGC AAA CTA CTG GAG CCA GAT GAC AAG CTT TTT ATC CTC AAT ACC CAA AGT 2850 

«r n a C O M L M S T I L K E L Q K K T Q 970 

AAA GAT GCC TGT GGG ATG CTG ATG AGC ACA ATT TTG AAA GAA CTT CAG AAG AAA ACT CAA 2910 

A r, P E Q V S L S E DLL Q P P V P R D 990 

GCC CTG CCA GAG CAA GTG TCA TTA AGT GAA GAT TTA CTT CAG CCT CCT GTC CCC CGG GAC 2970 

A G L I 1 T G K T L E F A L Q E 1010 

TCA GGG TTA CGA GCT GGA CTC ATT ATC ACT GGG AAG ACC CTG GAG TTT GCC CTG CAA GAA 3030 

<- , o K a F L E L T S W C Q A V V C C R 1030 

AGT CTG CAA AAG CAG TTC CTG GAA CTG ACA TCT TGG TGT CAA GCT GTG GTC TGC TGC CGA 3090 

O K S E V V K b V R S H L Q V M 1050 

GCC ACA CCG CTG CAG AAA AGT GAA GTG GTG AAA TTG GTC CGC AGC CAT CTC CAG GTG ATG 3150 

AC^ CTT GCT ATT GGT gSt GGT GC*C AAT GAT GTT AGC a5g ATA CAA GTG gJa GAC ATT GGG 3210 
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I G V S G Q E G M Q A * V M A S D F A V . S 1090 

ATA GGG GTC TCa'gGT CAA GAA GGC ATG CAG GCT GTG ATG GCC ACT GAC TTT GCC GTT TCT 3270 

Q F K H L S K L L L V H G H W C Y T R L 1110 

CAG TTC AAA CAT CTC AGC AAG CTC CTT CTT GTC CAT GGA CAC TGG TGT TAT ACA CGG CTT 333 0 

S N M I L Y F F Y K N V A Y* V N L L *F W 1130 

TCC AAC ATG ATT CTC TAT TTT TTC TAT AAG ,AAT GTG QdQ TAT GTG AAC CTC CTT TTC TGG 3390 

I ■ 

i . 

Y O F F C ,G F S G T S M T D Y W V L I F 1150 
TAC CAG TTC TTT TGT GGA TTT TCA GGA ACA TCC ATG ACT GAT TAC TGG GTT TTG .ATC TTC 34 50 

I 

F N L L F TiS A P P V I Y G. V L E K D V 1170 

TTC AAC CTC CTC . TTC ACA TCT GCC CCT CCT GTC ATT TAT .GGT GTT TTG GAG AAA GAT GTG 3 510 

S A E T L M Q L P E Li Y R S G Q K S E A, 1190 

TCT GCA GAG ACC CTC ATG CAA CTG CCT GAA CTT TAC AGA AGT GGT CAG AAA TCA GAG GCA 3570 

• I 

Y L P H T F W I T L L ,D A F Y Q S L V C 1210 
TAC TTA CCC CAT ACC TTC TGG ATC ACC : TTA TTG GAT GCT TTT TAT CAA AGC CTG GTC TGC 3 630 

t * 

F F V P Y F T Y\ Q G S D T D I i F A F G N 1230 

TTC TTT GTG CCT TA? TTT ACC TAC CAG GGC TCA GAT ACT GAC ATC TTT GCA TTT GGA AAC 3 690 

p l N T A A L# F I V'L 1* H L V I E S K S 1250 

CCC CTG AAC ACA GCC GCT CTG TTC ATC GTT CTC CTC CAT CTG GTC ATT GAA AGC AAG AGT 3750 

L T W I H L L V I I ... G S I L S Y F L F A 1270 

TTG ACT TGG ATT CAC TTG CTG GTC ATC ATT GGT AGC ATC TTG TCT TAT TTT TTA TTT GCC 3 810 

I V F G A M C V T C N' P P S N P Y W.I M 1290 

ATA GTT TTT GGA GCC ATG TGT GTA ACT TGC AAC CCA CCA TCC AAC CCT TAC TGG ATT ATG 3870 

q E H M L D P V F Y L V C I L T T S I A 1310 

CAG GAG CAC ATG CTG GAT CCA GTA TTC TAC TTA GTT TGT ATC CTC ACG ACG TCC ATT GCT 3 930 

L L P R F V Y R V L Q G S h F PS P I L 1330 

CTT CTG CCC AGG TTT GTA TAC AGA GTT CTT CAG GGA TCC CTG TTT CCA TCT CCA ATT CTG 3 990 

R A K H F D R L T P E E R T K A L K K W 1350 

AGA GCT AAG CAC TTT GAC AGA CTA ACT CCA GAG GAG AGG ACT AAA GCT CTC AAG AAG TGG 4 050 

R G A G K M N Q V T S K Y A N Q S A G K 1370 

AGA GGG GCT GGA AAC3 ATG AAT CAA GTG ACA TCA AAG TAT GCT AAC CAA TCA GCT GGC AAG 4110 

S G R R P M P G P S A V F A M K S A T S 1390 

TCA GGA AGA AGA CCC ATG CCT GGC CCT TCT GCT GTA TTT GCA ATG AAG TCA GCA ACT TCC 417 0 

C A I E O G N L S L C E T A L D Q G Y S 1410 

TGT GCT ATT GAG CAA GGA AAC TTA TCT CTG TGT GAA ACT GCT TTA GAT CAA GGC TAC TCT 4230 

E T K A F EM A G P S K G K E S * 1427 

GAA ACT AAG GCC TTT GAG ATG GCT GGA CCC TCC AAA GGT AAA GAA AGC TAG 4281 

ATACCCTCCTTGGAGTTGCAAGTATTCTTTCAAGGTTC 

CTTGTTTTTCC ATAAGGG AC ATG AGC ATTTTACTAGGCTTGG AAG AG CTG ACATG ATGAGC ATT ATTGTATGTTTGTAT 
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atacatttgtgatagagggctagagtttgacctagagagagt^ . 

tttgtaaaacttVitcgattttgtaaaagcattttc 

atatatttattttactaggagatcttatattc^ 

aaaaaagtagtttttaatacattggttaggactcag 

i , ■ . 

AAATCCCAAGAGCCTTTTAAACAACAAGGTACCTAAAATAGGG 

TAATAGCTTTTTATTTCCTATGGGAAGATGCTTTTGGTCTTC 

TGTTTTCCTGAGGTGGAGCCTTCATTGGAAAdGGGAAAGAGGGATTC 

TCTGTCAGGTTCCAATCAAGAGAAGACCTTTTATGAGATCTGCCTCTGT^ 1 

ATAGGTCAGGCAGACATCAGCTCAGCCTGTGGCCCATTGGGTGATT^ 

AAGTbATACAATCAATTTCAAAACAATCTTCCAGAGACCACTTGAAGGTTCA 

CAGGTGTTGGAGCCTCTAAAATATGAGATATAAACAGAAACTAATACAAGTTGTTCTC 

TAGAAAAATTTGGTTTTAAAATCATTTGAGGACAGGAATC 

GGCTTTAGTTGTAACAAACGATTTTATTCTM^ 

TTTGAAGTCCTATCTCn s ATTTATTATATTT 

GCTCATATGCAATGTCTACATCAAGGTCTTCTTAATGACTATTATTCTCAG 
TTTTGGTCTGACATTTTTGTAGCCTTCTGTTATTATTGGAAATAGTC 
ATCTCACATAGCTAATGGAAGTTGCTTTCTGCTTTCTTATGA 
AAAAAAAAAGGGCGGCCGC 
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Protein Family / Domain Matches, HMMer version 2 

Searching for complete* dqmains in PFAM ." , 

hmmpf am - search a Single seq against HMM database ' . 

HMMER 2.1.1 (Dec 1998) • 
Copyright (C) 1992-19*98 Washington University School of Medicine 1 r 
HMMER is freely distributed under the GNU General Public License (GPL). 

HMM file: /prod/ddm/seqanal/PFAM/pfam6.4/Pfam 
Sequence file: /prod/ddm/wspace/orf anal/oa -script . 14482 . seq 

Query: 67102 • 

v • , 

Scores for sequence family classification (score includes all * domains) : J 
Model Description Score E-value N 

' _ _ _ --.I _ _ _ 

Hydrolase ,haloacid dehalogenase-like hydrolase ' 1.5 0.17 

1 ' 

DUF6 Integral membrane protein DUF6 -24.6 9.4 

1 

* ■ * 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score Envalue 



Hydrolase l/i 432 1077 1 184 [] 1.5 0.17 

DUF6 1/1 1127 1^71*:. 1 126 '(] -24.6 9.4 



I 



Alignments of top-scoring domains: 

Hydrolase: domain 1 of 1, from 432 to 1077: score 1.5, E *= 0.17 

*->ikavvFDkDGTLtdgkeppiaeaivealrelgl . . . .apleevekll 
i ++ Dk+GTLt+ + 4- ++ v+ ++ +++++++ e+ 
67102 432 IQYLFSDKTGTLTEN- KMVFRRCSVAGFDYCHEenarRLESYQEAVS 477 

grgl .g. . erilleggltaell . . . . 

+ + +++ ++1 + +++ ++,+ ++ +++ + •*; + + + ++ + +++ + 
67102 478 EDEDf IdtVSGSLSNMAKPRAPscrtvhngplgnkpsnhlagssf tlgsg 527 



+ + ++ ++++ +++ + + + ++*+ ++ + +++ + + + + ++ 
67102 52 8 egasevphsrqaaf sspietdwpdtrlldkf sqitprlf mpldet iqnp 577 



+ ++ + ++ + ++ ++ + ++++ ++ + ++ ++ ++ 

67102 578 pmetlyiidff ialaicntvwsapnqprqkirhpslgglpiksleeiks 627 



+ + +++++++ +++++++++ ++ ++ + ++ ++ ++ +++ + 
67102 62 8 lfqrwsvrrssspslnsgkepssgvpnafvsrlplf srmkpaspveeevs 677 



+++ ++++ +++++ +++ + + + + ++ ++ + + + ++ 
6 7102 67 8 qvcespqcssssacctetekqhgdagllngkaeslpgqplacnlcyeaes 72 7 



+++ + + ++++++ + +++.++++++ 

67102 72 8 pdeaalvyaarayqctlrsrtpeqvmvdf aalgpltfqllhilpf dsvrk 777 



+ + +++ ++ +++ ++ + + ++++ + ++ ++++ 

67102 77 8 rmsvvvrhplsnq^nrvytkgadsvimellsvaspdgaslekqqmivrekt 827 



++++ ++++ ++ ++++ + +++ ++++++++ + 
67102 828 qkhlddyakqglrtlciakkvmsdteyaewlrnhflaetsidnreellle 877 
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67102 



67102 



67102 



67102 



67102. 



....... . ld.evlglial . dklypgarealkaLkerGikvailTngdr . na 

\t + i.+ lg+ +d 1 +g++e ++aL+ + +Gik+'+ + lT+ + *++a 

878 samr 1 eNKlTLLGATG IeDRLQEGVPES I EALHKAG I KI WMLTGDKQeTA 927 
1 I • ■ t ■ t P ■* 

ealle . . .algla . lfdai\rdsdevggvgpVvvgKPkpeif llalerlgv 
+ ++ + + + ++ + + + +g ' . + i ++ + + + + + + 

92 8 VNI AYa ckLLE PDdKLF I LNTQS KDA- — - - -CfcMLMSTILKELQKKTQA 971 

kpeevg , . . . 

pe+y+ +++ ++++++++ + + ++ + +, + ++++++ + + + 
972 LPEQVSlsedllqppvprdsglragliitgktlefalqeslqkqflelts 1021 

• • p . kvlmvGDginDapalaaAGvgv 

, + + ++ ++++ + +++ +l++GDg nD+ ++ A++g+ 

.1022 wcqavyccratpl qksewklvrshlQvMTLAIGDGANJpVSMIQVADIGI 1071 

, atogngg*:-* * 
1*072 GVSGQE 1077 



DUF6: domain 1 of 1, from 1127 to 1271: score -24 .6, E ..- 9.4 

*->fiWalytvfskklle. . spl t f tawrf liagilllilllf lkkgppl 
+++ ++++ +++ ++++++ +1+ : + ++ + + + ++ 
67102 1127 LLFWYQI^FCGFSGTSmtDYWVIjIFFNLLFTS APPVI YGVLEKDV 1170 



67102 



.lallslkilallylgilgtalgyllyf 

+ ' ++ + ++ +++++++ + ++ 1 +y ^.+ + + + + + + y + y 

1171 saetlmqlpelyrsgqkseaylpHtFWITL-LDAFYQSLVCFFVPYFTYQ 1219 



67102 



yalkyvsaskasvlsslsPvf tlilsvllLgEkltlkqllGivlillGvl 
+ ++ + +++ +f+++l ++ +lt+++ll i+ ++1+ + 
1220 ■- -GSDTDIFAFGNPLNTAAIiFIVLLHLVIESKSLiTWIHLLVIIGSILSYF 1267 



67102 



lisl<-* 

1- + 

1268 LFAI 1271 
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♦USTAL W (1.74) multiple sequence alignment 
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useAT5A 



•h67102FL, 
useAT5A 



>h67102FL 
>useAT5A 



>h67102FL 
>useAT5A 



>h67102FL 
>USeAT5A 



;>h67102FL 
3UseAT5A 



Dh67102FL 
miseAT5A 



oh67102FL 
ouseAT5A 



bh67102FL 
ouseAT5A 



bh67102FL 
ouseATSA 



bh67102Fb 
.ouseATSA 



*bh67102FL 
K>useAT5A 



*bh67102FL 
>OUSeAT5A 



^bh67102FL 
<K>useAT5A 



-bh67102FL 



--- - MTEALQWARYHWRRLIRGATRDDDSGPYNYSSLI^ 

MERELPAAEESASSGWRRPRR- - RRWEGRTRTVRSNLLPPLGTEDSTIGAPKGERLLMRQ 
;*•*** * -* : . '*.**. * * . : 

T/^\ * 

HIOPFKDEYEKFSGAYVNNRIRTTKYTLLNFV PRNLFBQFHRjA ASLYFLFL 

CIQHLAD- - - - - ^LKTTKYTLLSFL pKNLFBQFHR^ 

^ * . * **..***** *.* im * . * .******** * * • * * * ' 

• Tr\-U 

EA FQKEll TM Li P LWVLT 1 1 A I KDGlfeD Y R K Y K I DKQ I NN LXTKVY S R KE KK Y*I PRC WKD V 
fNAFQPGIALAPVLFIIAVTAlj K^ 
. *** . . . . *-. **** *** ..i *..**.* *.**.****..* ** . . 

TVGDFI RLSCNEVI PADMVLLFSTDPDGI CH BE^cibGESWLKQRQVVRGYAEQDSEVD 
RVGDFVRLCCNEI I PAD I L»LIjS S SD PDGLOH 1 E*I ANyDGETNLKRRQVVRGFSELVSEFN 



******* m *** : **** . . ** .*.★***.*****. m **** : *** : ****** : - * ** m . 

PEKFSSR I ECE S PNNDLSRFRG FLEHSNKERVGLS KENLLLRGCTI RNTEAWG I WYAG 
PLTFTSVIECEKPNNDLSRFRGYIMHSNGEKAGLHKENLLLRGCTIRNTEAVAGIVIYAG 
* *.★ **** w ********** :: *** * - ^ * * ***************** m ***.** * 



KTGT LllEN KMVFRRC S V AG FD Y CH E EN ARR LE S YQ E 
KTGTLTte NKMVFRRCTVSGIEYSHDANAQRIARYQE 



H ETKAMLNNSG PR YKRS KLERRANTE^LWCVMLLV I MCLTGAVGHG I WljS R Y - E KMHFFN 
HKTKATJiNNSGPRYKRSOLEROMNCI JVLWCVLLIiVCISL 

*****•***********:***: * ******.*** c<> * m ***★★.*.** ** : * : 

__ T^y , 

vpepdghiisi ^S]agfymf wtmiil,lqvlipisly msibiv ki»gqiy^ 

VPESDGSSLSPATAAVYSF tFTMIIV^^ 

*** ** .** *..* * : -**** : ***************** : *:***:•*:::*:*: 

DS I VQCRALN I AEDLGQ I QYlP 
DSQLQCRALN I TEDLGQ I KY I 

* * i******* : * * .* * * * - * . V* * *************** : * : * : ; * . * : ** : ** * * * 

AVSEDEDFIDTVSGSLSNMAKPRAPSCRTVHNGPLGNKPSNHLAGSSFTL.GS-GEGASEV 
ADS EEEEWS KV - GTI S HRGSTG - - SHQS - 1 WMTHKTQS I KSHRRTGSRAEAKRAS 

* **.*.;...* *::* : - * ** .*. 

PHSRQAAFSS PI ETD WPDTRLLDKFSQ ITPRLFMPLDETI QNPPM ET^YIIDFFI 

MLSKHTAFSSPMEKDITPDPKLLEKVSECD-R-F^ 

* :: .*****:*„*: „**.:**:*.*: . ::* * :: **** 

" ALTICWlVVVTSPDQPRQKVRWFELiKSPVKTIED FLRRFTPSRLASGCS S I GNLST 

**.*******;;*:*****;* *:*::*: *:*..*:.: 

* * 

S - GVPN AFVS RLPLFS RMKP AS P VEEE VSQ VCE S PQCS S S S ACCTETEKQHGDAGLLNG K 
SKSSHKSGSAFliPSLSQDSMLLGLEEKIiGQT- - APSIASNGYASQAGQEESWASDCTT- - 

* . : : • ** i*: . :**::.*. :**:*.... ::: ' - 



AESLPG-Q- PIx^CN- LCYEAESPDEAALVYAARAYQCTI^SRTPEt^^ 



DQKCPGEQRECX^EGELRYEAESPDEAALVYAARAYNCAl.VDRLHDQVSVELPHIiGRLTFE 

. ** * * ******************-*;* m *- ;** *;;. ** ***; 

TZlHTQpFDSVRKRMSVXAniHPLSNQVVVY 

LLHTLGFDS I RKRMS W I RH PLTDEI NVYTKGADSWMDL.LL.PCS SDDARG - RHQKKIRS 

*** * A**;*******.****;;:; *********;*;** Z Z* I * ~ 

KTQKHLDDYAKQGLRTLCIAKKVMSDTEYAEWLRIWFIjAETSIDNREEIjIjLESAM^ 
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mseATSA 



>h67102FL 
>useAT5A 



>h67102FL 
mseAT5A 



>h67102FL 
juseATSA 



3h67102FL 
ouseATSA 



Dh67102FL 
>useAT5A 



3h67102FL 
DUseATBA 



Dh67102FL 
DUseATSA 



bh67102FL 
ouseATSA 



KTQNYI^LYAVE9LRTIX:iAKRVLSKEEYAC^ 
•***..*. * ★ -*********.*.* * * * * * : . * : **:*:*;..*****:;**;***■- 

. » » • 

LTLLGATG I EDRLQEGVPES I EALHKAG I KI WML.TGDKQETAVNI AYACKLLEPDDKLFI 
LHLI^ATGIEDRLQEGVPETIAKLRQAGU^IVA/LTGDKQETAINIAYACKliLDHGEEVlT 
+ ******************* * * r .**•********* : ********* : 

LNTQSKDACGMLMSTILKELQKKTQALPEQVS' LSEDLLQPPVPR- -DSGLRAGLIIT 

LNADSQEACAALLDQCLSYXQSRNPRSTLQNSESNLSVGFSFNPVSTSTDAS^SPSLVID 
.**..*..**..* J . *.;*.:. :.*■*■ **- *:. 

GKtLEFALQESLQKQFLELTSWCQAWCCRATPLQK^ 
i GRSLAYAIiEKSLEDKF^F]^KQCRSVLCCRSTPLQKS^^^ 

* :: * :**::**: *:. * :: * •***:****** *******;*;.*********** 

— — — pL*^^ 9 *?'** Tr*~/>~* — — — — J 



vswr 2VADIG IGVSGQEGMQAVMASDFAVSQFKHLS KLLLVHGHWCYTRLSNM I LYFFYK 
VS MIDVADVGVG I SGQEGMQAVMASDFAVPRFRYLERLLI VHGHWCYSRLANMVLYFFYK 
******** .*.*.***************.*. :* : ;*. ;**:*******:**.**;****** 

^_ ^7 I . 

NVAYVNLLFW YQFFCG FSGTSMTEfeWVL I FFNLLiFTS AP PV I YGVjyEKD VS AETLMQLPE 
NTM S VGLLFW FQ FY CG FS AS AM I DCJtfYL I F FNLLFS SL PQLVTGVf LDKDVPADMliliRE PQ 



* ****.******* .-* * * ********.* * :: ***.★**.*. *- 



LYRSGQKS EAYLPHTFW I TLLDAF YQ S L VCFFVP Y FTYQGS DTD, t FAFGNPLNTAALFI V 
LY KS GQNMEE YRPRAF WIjNMVDAA FQ S LVC F F I PYLAYYDSDVIWFTWGTPVTAIAXrFTF 
**„***. * * *::**:. f:** .****♦**.**; : * _**.*:*::*.*;.: * * * 



LZJHLVlteSKSL^ 

LLHIiG IE TKTWTV ^LNWIiACGFS TFLF FS VAL I YNt TSCATCY P P SN P YWTMQI^jLtGDMjFY 



* ** ******* ** 



** ** ** - * ; **; 

LVCILTTSIALLPRFVMRVLQGSLFPS PIL*R~ - - -AKHFDRLTPEERTK 

LTGLIAPIAAI^ 

* * . . . +****...**★****. * : : : * : * ::: 



bh67102FL 
ouseATSA 



bh67102FL 
ouseATSA 



ALKKWR G- - AGKMNQVTS KYAN - - QSAGK- SGRRP-MPG - PS AVFA - MKS 

TELSERKTMGPFETLPRDCASQASQFTQQLTCSPEASGEPSAVDTNMPLRENTLLEGIiGS 
; * * ■ '*.:..*.*•: : 1 ,:::*:-*- - ** . : : : : * 

ATS - CAI EQGNLS - LCET - ALDQG YS ETKAFEMAG - - - - PSKGKES - 

QASGSSMPRGAISEVCPGDSKRQSSSASQT^LSSLFHLPSFGSI^ISSLSIASGLGSV 

.* . * . * . * • .* . * ; : : . : : . ** * . • 



bh67102FIi 
ouseATSA 



XiQIjSG S SLQMD KQEXjEFLiS N P PQ PEQDLrH S FQGQVTG Y L 
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Input file Fbh44l81pat . seq; Output File Fbh44 181pat . tra 
Sequence length 7221 ' ' » • 

gcccx:gggatgggaacgcggcgcggggagtgaggcagtg^ 

gcccgctcccgcctctgtcttgtcggcctccacctgcagccccgcggcgcccgcgccccgcgggacccggacggcgacg 

M W R W I R Q O i L t G F D P. P H Q S 17 

ACGGGGGA ATG TGG CGC TGG ATC CGG CAG CAG CTG GGT TTT' GAC CCA CCA CAT CAG AGT 51 

» f ' . • • 

• D T R T I Y V A H R F P Q N G L Y T t P Q 37 

GAC ACA AGA ACC ATC TAC GTA GCC CAC AGG TTT CCT CAG AAT GGC/CTT TAC ACA CCT CAG 111 

K F I D N R I I ' S S K Y T , ' V W N F V P K 57 
AAA TTT ATA GAT AAC AGG ATC ATT TCA TCT AAG TAC ACT GTG TGG AAT TTT GTT CCA AAA » 171 

l . • •":*■'• 

N L F E Q * F R R V A N F Y F L I I F L V 77 

AAT TTA TTT GAA CAG TTC AGA AGA GTG GCA AAC TTT TAT TTT CTT ATT ATA TTT TTG GTT 231 

Q L M I P T P T S P V T S G L P L F F V 97 

CAG CTT ATG ATT GAT ACA CCT ACC AGT CCA GTT ACC AGT GGA CTT CCA TTA TTC TTT GTG 291 

I T V T A I K Q G Y E D W L R H* N S D N 117 

ATA ACA GTA ACT GCC ATA AAG CAG GGA TAT GAA GAT TGG TTA CGG CAT AAC TCA GAT AAT 3 51 

E V N G A P V Y V V R S G G 1 L V K T R S 137 

GAA GTA AAT GGA GCT CCT GTT TAT GTT GTT CGA AGT GGT GGC CTT GTA AAA ACT AGA TCA 411 

K N I R V G D I V R I A K D E.I. F P A D 157 

AAA AAC ATT CGG GTG GGT GAT ATT GTT CGA ATA GCC AAA GAT GAA ATT TTT CCT GCA GAC 471 

L V L L S S D R L P G S C H V T T A S L 177 

TTG GTG CTT CTG TCC TCA GAT CGA CTG GAT GGT TCC TGT CAC GTT ACA ACT GCT AGT TTG 531 

D G E T N L K T H V A V P E T A L L Q T 197 

GAC GGA GAA ACT AAC CTG AAG ACA CAT GTG GCA GTT CCA GAA ACA GCA TTA TTA CAA ACA 591 

V A 1 N L D T L V A V I E C Q Q P E A D h 217 
GTT GCC AAT TTG GAC ACT CTA GTA GCT GTA ATA GAA TGC CAG CAA CCA GAA GCA GAC TTA 651 

V R F M G R M I I T Q Q M E E I V R P L 237 
TAC AGA TTC ATG GGA CGA ATG ATC ATA ACC CAA CAA ATG GAA GAA ATT GTA AGA CCT CTG 711 

G p E „S L L L R G A R L K N T K E I F G 257 

GGG CCG GAG AGT CTC CTG CTT CGT GGA GCC ' AGA TTA AAA AAC ACA AAA GAA ATT TTT GGT 771 

V A V Y T G METKMALNYKSK S.Q 277 
GTT GCG GTA TAC ACT GGA ATG GAA ACT AAG ATG GCA TTA AAT TAC AAG AGC AAA TCA CAG 831 

K R S A V E K S M N T F L I I Y L V I L 297 

AAA CGA TCT GCA GTA GAA AAG TCA ATG AAT ACA TTT TTG ATA ATT TAT CTA GTA ATT CTT 891 

I S E A V I S T I L K Y T W Q A E E K W 317 

ATA TCT GAA GCT GTC ATC AGC ACT ATC TTG AAG TAT ACA TGG CAA GCT GAA GAA AAA TGG 951 

D E P W Y N Q K T E H Q R N S S K I L R 337 

GAT GAA CCT TGG TAT AAC CAA AAA ACA GAA CAT CAA AGA AAT AGC AGT AAG ATT CTG AGA 1011 
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F I S i D F L A F L V L Y N ' F I I P I S L 357 

TTT ATT TCA GAC TTC CTT GCT TTT TTG GTT CTC TAC AAT. TTC ATC ATT CCA ATT TCA TTA 1071 

Y V T V E M 0 K F L G S F F I G W D L D 377 
TAT GTG ACA GTC GAA ATG CAG AAA TTT CTT GGA TCA TTT TTT ATT GGC TGG GAT CTT GAT 1131 

L Y H E E S D v O K A Q V N T S ' D L N E . E 397 

CTG TAT CAT. GAA GAA TCA GAT CAG AAA GCT CAA GTp AAT ACT i TCC GAT CTG AAT GAA GAG 1191 

L G Q V E Y V F -T D K T G T L T E N . E M 417 

CTT GGA CAG GTA GAG TAC GTG TTTj ACA GAT AAA ACT GGT ACA CTG ACA GAA AAT GAG ATG 1251 

Q IF R E C S I N G M K Y Q E I N G R L V | 437 

CAG TTtf CGG GAA TGT TCA ATT AAT GGC ATG AAA TAC CAA GAA ATT AAT GGT AGA CTT GTA 1311 

P E G P T PUSS E G.N L S Y L S S L S 457 

CCC,G^A GGA CCA ACA CCA GAC TCT TCA GAA GGA AAC TTA TCT TAT CTT AGT AGT TTA TCC 1371 

H L N N L S H L» T T S S S F R T S P E N 477 

CAT CTT AAC AAC TTA TCC CAT CTT ACA ACC AGT TCC TCT TTC AGA ACC AGT CCT GAA AAT 1431 

E T E L I K E H D L F F K A V S L C H T 497 

GAA ACT GAA CTA ATT AAA GAA CAT GAT CTC TTC TTT AAA GCA GTC AGT CTC TGT CAC ACT 1491 

V Q I S N V Q T D C T G D G P W Q S N L 517 
GTA CAG ATT AGC AAT GTT CAA ACT GAC TGC ACT GGT GAT GGT CCC TGG CAA TCC AAC CTG 1551 

A P S Q L E Y Y A S S P D E K A L V E A 537 

GCA CCA TCG CAG TTG GAG TAC TAT GCA TCT TCA CCA GAT GAA AAG GCT CTA GTA GAA GCT 1611 

A A R I G I V.F I G N S E E T M E V K T 557 

GCT GCA AGG ATT GGT ATT GTG TTT ATT GGC AAT TCT GAA GAA ACT ATG GAG GTT AAA ACT 1671 

L G K L E R Y K L L H I L E F D S D R R 577 

CTT GGA AAA CTG GAA CGG TAC AAA CTG CTT CAT ATT CTG GAA TTT GAT TCA GAT CGT AGG 1731 

i 

r M S V I V Q A P S G E K L L F A K G A 597 

AGA ATG AGT GTA ATT GTT CAG GCA CCT TCA GGT GAG AAG TTA TTA TTT GCT AAA GGA GCT 1791 

E S S I L P K C I G G E I E K T R I H V 617 

GAG TCA TCA ATT CTC CCT AAA TGT ATA GGT GGA GAA ATA GAA AAA ACC AGA ATT CAT GTA 1851 

D E F A L K G L R T'L C I A Y R K F T S 637 

GAT GAA TTT GCT TTG AAA GGG CTA AGA ACT CTG TGT ATA GCA TAT AGA AAA TTT ACA TCA 1911 

KEY E E I D K R I F E A R T A L Q Q R 657 

AAA GAG TAT GAG GAA ATA GAT AAA CGC ATA TTT GAA GCC AGG ACT GCC TTG CAG CAG CGG 1971 

E E K L A A V F Q F I EKD LI LLGA 677 

GAA GAG AAA TTG GCA GCT GTT TTC CAG TTC ATA GAG AAA GAC CTG ATA TTA CTT GGA GCC 2031 

T AVEDRLQ DKVRETIEALRM 697 

ACA GCA GTA GAA GAC AGA CTA CAA GAT AAA GTT CGA GAA ACT ATT GAA GCA TTG AGA ATG 2091 

A G I K V W V L T G D K H E T A V S V S 717 

GCT GGT ATC AAA GTA TGG GTA CTT ACT GGG GAT AAA CAT GAA ACA GCT GTT AGT GTG AGT 2151 
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C S C G H F* H R T M N I L ' E L I N Q K S 737 

TTA TCA TGT GGC CAT TTT CAT AGA ACC ATG AAC ATC CTT GAA CTT ATA AAC CAG AAA TCA 2211 

D S E C A E Q L» R Q L A R R I T E D H V 757 

GAC AGC GAG TGT GCT GAA CAA TTG AGG CAG CTT GCC AGA AGA ATT ACA GAG GAT CAT GTG 2271 

I Q H G L V V D G T S L S L A L R E , H E 777 

ATT CAG CAT GGG CTG GTA GTG GAT GGG ACC AGC CTA TCT CTT GCA CTC AGG GAG CAT, GAA 23 31 

K L F M E V C R'NC S A V L C C R M A p' 797 

AAA CTA TTT ATG GAA G'lT TGC AGA AAT TGT TCA GCT GTA TTA TGC TGT CGT ATG GCT CCA 23 91 

U Q K A K V il R h I K I S P E K P I T L 817 

CTG CAG AAA GCA AAA GTA ATA AGA CTA ATA AAA ATA TCA. CCT GAG AAA CCT ATA ACA TTG 24 51 

A V G d' G A N D V S M I Q E A H V G I G 837 

GCT GTT GGT GAT GGT GCT AAT GAC GTA AGC ATG ATA GAA GAA GCC CAT GTT GGC ATA GGA 2511 

■ l 

. z M G K E G R Q A A R ' • N S D Y A I A R F 857 

ATC ATG GGT AAA GAA GGA AGA CAG GCT GCA AGA AAC AGT GAC TAT GCA ATA GCC AGA TTT 2571 

K F L S K L L F V H G H F Y Y 1 ± R (T A T 877 

AAG TTC CTC TCC AAA 1 TTG CTT TTT GTT CAT GGT CAT TTT TAT TAT ATT AGA ATA GCT ACC 2631 

L v Q Y F F Y K N v'C F I T P Q F L Y Q 897 

CTT GTA CAG TAT TTT TTT TAT AAG AAT GTG TGC TTT ATC ACA CCC CAG TTT TTA TAT CAG 2691 

F y C L» F S Q - Q T L Y D S V Y L T L Y N 917 

TTC TAC TGT TTG TTT TCT CAG CAA ACA TTG TAT GAC AGC GTG TAC CTG ACT TTA TAC AAT 2751 

XCFTS LPILIYSL L EQ HV DP 937 

ATT TGT TTT ACT TCC CTA CCT ATT CTG ATA TAT AGT CTT TTG GAA CAG CAT GTA GAC CCT 2811 

HVLQNKP TLY RDISK NRLLS 957 

CAT GTG TTA CAA AAT AAG CCC ACC CTT TAT CGA GAC ATT AGT AAA AAC CGC CTC TTA AGT 2871 

T K T * F L Y W T I L G F S H A F I F F F 977 

ATT AAA ACA TTT CTT TAT TGG ACC ATC CTG GGC TTC AGT CAT GCG TTT ATT TTC TTT TTT 2931 

G s Y U L I G K D T S L L - G N G Q M F G 997 

GGA TCC TAT TTA CTA ATA GGG AAA GAT ACA TCT CTG CTT GGA AAT GGC CAG ATG TTT GGA 2991 

N W T F G T L V F T , V . M V I T V T V K M 1017 

AAC TGG ACA TTT GGC ACT TTG GTC TTC ACA GTC ATG GTT ATT ACA GTC ACA GTA AAG ATG 3051 

A L E T H F W T W I N H L- V T W G S I I 1037 

GCT CTG GAA ACT CAT TTT TGG ACT TGG ATC AAC CAT CTC GTT ACC TGG GGA TCT ATT ATA 3111 

F Y F V F S L F Y G G I L W P F L G S Q 1057 

TTT TAT TTT GTA TTT TCC TTG TTT TAT GGA GGG ATT CTC TGG CCA TTT TTG GGC TCC CAG 3171 

N M y F V F I Q L h S S G S A W F A I I 1077 

AAT ATG TAT TTT GTG TTT ATT CAG CTC CTG TCA AGT GGT TCT GCT TGG TTT GCC ATA ATC 3231 

I* M V V T C L F L D I I KK V FD R K L 1097 

CTC ATG GTT GTT ACA TGT CTA TTT CTT GAT ATC ATA AAG AAG GTC TTT GAC CGA CAC CTC 3291 
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H P T S T E K A Q L T E T N A G T K C L 1117 

CAC CCT AC A AGT ACT GAA AAta GCA'CAG CTT ACT GAA AC A AAT GCA GGT ATO* AAG TGC TTG 3351 

> i 

D S M C C F P E G E A A C A, S V G R M L 1137 

GAC TCC ATG TGC TGT TTC CCG GAA GGA GAA GCA GCG TGT GCA TCT GTT GGA. AGA ATG CTG 3411 

E R V I G R C S P T H I S S S W S A S D 1157 

GAA CGA GTT ATA GGA AGA TGT AGT CCA ACC CAC ATC AGC AGT TCA TGG AGT GCA, TCG GAT 3471 

v i 

P F Y T N D R S I I L 1 T L d T M D ' S S T 1 Ci . 1177 

CCT TTC TAT ACC AAC GAC AGG AGC ATC TTG ACT CTC TCC AC A ATG GAC TCA TCT ACT TGtf 3531 

i . 1178 

TAA , ' 3534 

AGGGGCAGTAGTAC7TTGTGGGAGCCAGTTCACCTCCTTTC 

AGCTCTGAAATTAATTTCCAAAATCTTTGTAGTAGTTCATACCCACT^ 

TAGTACAAGCCCCTCCCAACACCCTTAATTTG 

TTTGTCTCXmTGTCCCTTGTGCTTAT^ . 

i ' 
i 

AAATGTAGAAAAAAGAGAGAAATCTTAGTAAAGAGTATTTTTTAGTATTAG 
TGCTTCTGTAAATTATGCTGAAAGTTTGCCTrTGAG 

AAAAGTTAATGTGAATACTGAGGAATTTTGGTCCCTCAGTGACCTGTGTTC 
AGAGCAAATTAGGAGAATCATTTCCAACCATTATTTACTGCAGTATGGGGAGTAAA 

ACTGTAACACAGCCTGTAAAGTTAGCCATATAAATGCAAGGGTATATCATATATACAAATCAGGAATCAGGTCCGTTCA 

CCGAACTTCAAATTGATGTTTACTAATATTTTTGTGACAGAGTATAAAGA^ 

GCATATTATTAATTTAATGTCTTTATCATTC^ 

TTCTCTTTACCTGAAGGCTCTGTGTATAGTATTTCATGACATCGTTC 
GTATTTT^AATATTGCAAATATGTTTAATTATACAAATCAGAATAGTATC 
TClfTTCTGCAGCCGACTTAGACATGCTCTTCC 
TATTTTCAGGTTATGTCATCTAACTTATAGCAAACTACCACAA 

ATTTCAGGTGTGGCTGTGGAATGTAAAAATGCTCAACTTGTATCAGGTAATGTTAGCAATA 

ATTAATCGGGTACATGTTACTGTAATTAACTCATTGCAC^ 

AGTATTGTCATTTGTTTTTGTTTTATTGAAAAGTAATGTTG 

TACCCAGCTCTAAGCAAATAATGATTGTATACATAT^^ 

AATGTAATTCCTTTATGGAGATTTATTGTGCAGCCCT 

GACTGGCAGGGGAAAGAATGGTAGAGACAGAAATTAAGACTTTATCC 

ATGTAACATTTGTCTGTTCCAGTGATGTAAGGATATTAAGTTATTAAGCT 
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TTAACTTAGATA*rrT^ATAGCTGGATTT 

GTCTGCATTCACTAATTCATGTTCCAGAAGAGGAAATAATGA^ 

AAATTTGCTCATAAAATCTCTTATAAAACGTGCATATAACAAAATGACACCCAGTA 
CGTGTTTATTTGCCATCAAATAAACTGAGTACTGACACCAGAC^ 

TGCAGCAAGACAGGAGGtCAGCTCGCCTATAATGGTGCTTAAAGTGTGATTGATGTAATTT^ 

AGTTAGTTAAGGAGAACTTTATTTTTTTAAAAAAAGTAAATGGCAACCACTAGTGTG 

AATCCACTCCGTTTTTAAAGCAAAATTATCTTGTGATTTTAAGA 

• . . . . » 

CAGTCTdCAAGCTrrcAGTAGTTTTC 

AAAGTGTCCCCTTTGCATATTTCl^AAAATTCTrrc 

CAAAACCAGAGCAAAATGCTAAATACGTTATTCCTAATCAGTGGTCTCA 

GGGCTGTAAGCCTGAAGATAGTGGCAAGCACCAAGTCAGTTTCCAAAATTGCCCCTC 

CCCTGCCTCAGCTTCAGCAGGCGTAGGCTCACGCTGGGCGGAGCAAAGTATGrGGCCAGGGAGAACTACAGCTACGAAGA 
CCTGCTGTCGAGTTGAGAAAAGGGGAGAATTTATGGTCTGAATTTTCTAACTG^ 

AATACAGAAAGGCTTCCAGACCTGAGCCACACCCAGGCCCTATCCTGAACAGGAGACTAAACAGAGGCAAATCAACCCT 
AGGAAATACTTGCATTCTGCCCTACGGTTAGTACCAGGAC 

TATCTGATCGCTTGAGACTCCTAATAGGCAGGAGTCAJ\GGCCACTAGAAAATTGACAGTTAAG 
AATATGCTACTCTGAAAAATCTCGTGAAGGCTGTAGGAAAAGGGAGAATCTTC 
CAGTTTGGGGTATGATATAAGCAGGTA1TAATAAAAATAACACACCAAAGAGTTACGTAAAA 
GGTCCCdACGTAQAGACATTTTATTTCTATTTTGAAATGAGTTATCT 

TGTTTTATGTGAAATAACTTGAATGTTGTTCCTATAA 

TTAGATTTTTATGAGGAATGAGTATCTGGA^^ 

CTGTCrAATGTAATCCTTTAAAAATTCTCTGCATTC 

TAAAGTTTATGAAGTTATATTTATCAAATAAAAACTTTCCTATAT 
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Protein Family / Domain Matches, HMMer version 2 • 

Searching for complete dswnai^s in PFAM ...» , 
hmmpfam - search a single seq against HMM database 
HMMER 2.1.1 (Dec 1998) 

Copyright (C) 1992-1998 Washington University School of Medicine » r 
HMMER is freely distributed under the 'GNU General Public License (GPL) 

HMM file: /prod/ddm/seqana'l/PFAM/pfam6.4/Pfam 
Sequence file: /prod/ddm/wepace/orf anal /oa-script . 15759 . seq 

Query: 44181 i 

Scores for sequence family ,classif idation (score includes all domains): 
Model Description , Score E-value N 



0.13 



Hydrolase haloacid dehalogenase-like hydrolase 42.8 8e^-09 

1 1 • ' 

El-E2_ATPase E1-E2' ATPase ' 8 , 6 
1 ' 
DUF13 2 t Protein of unknown function DUF13 2 -72.9 9.4 

1 " 

Parsed for domains: I 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 



El-E2_ATPase l/l 126, 1 1*64 . . 37 75 8.6 0.13 

DUF132 1/1 579 719 .. 1. 160 [J • -72. ^ " 9,4 I 

Hydrolase ■ l/l 401 842 . 1 184 (1 42.8 8e-09 

Alignments of top- scoring domains: ' 

El-E2_ - ATPase: domain 1 of 1, from 126 to. 164: score 8.6,. E m 0.13 
* - >VlRdGkeeeipaeeLvpGDiVevkpGdrVPADgrwege< - * 
V+R G + + + + + + +++GDiV+++ ++ PAD+++++++ 
44181 126 NAmSGGLWTOSKNIRVGDIWIAKDElFPMLVLLSSD 164 

DUF132: domain 1 of 1, from 579 to 719: score -72.9/ E = 9.4 

*->MeeiklkWIDTsVliaA. . . . . LispkGlaf kllelLf eeKleN. . 
M V++ A+++++L+ kG +1 +++ +e ++ 

44181 579 MS VIVQApsgekLLFAKGAESSILPKCIGGEIEKtr 614 

.YtSdeiLeEyifkillpKLekklpvEvslkkvl.wlvskSkvinPRSF 
+ + + L+ 4- + 1 . + +k+ + E ++1++ ++ + + 
. 44181 615 iHVDBFALKGL- -RTLCIAYRKFTSKE- - YEEIDkRIFEARTALQQR-- - 657 

KESntkFnvcRDpeDNKFLn . . . wYesKAdyl ITyDkDLLdRLRDENkk 
+k + F++++ + + A +L d+ R 

44181 658 EEKLAAV FQFIEkdl ILLGATA- VEDRLQDKVRETIEA 694 

lkledHefkvLTPkEFiesveiqcls<-* 
1 + + +++ + vLT v +ls 

44181 695 LRMAG I KVWVLTGD KHETA VS VS LS 719 

Hydrolase: domain 1 of 1, from 401 to 84 2: score 42.8 r E « 8e-09 

*->ikawFDkDGTLtdgk. . . 

+ v+ Dk+GTLt+ + + + + + ++ + ++ 4++ +++ +++++++ 
44181 401 VEYVFTDKTGTLTENEmqfrecsingmkyqeingrlvpegptpdsse 447 



+ 4- + 44- 4-4- 4-+ +4- + + + + + +++++++++ + + + + 4- 4- + + 

44181 448 gnlsylsslshlnnlshlttsssfrtspenetelikehdlf fkavslcht 497 



eppiaeaivealrelgl .... 

+ +4- + + + 4-4+4-+ +++ +++ + + p ++ a+vea++++g+ + 
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44181 498 VqisnvqtdctgdgpwqsnlapsqleyyaSSPDEKALVEAAARIGIvf ig 547 

* '■'.»' * i 1 ■ ' 

apleevekllgrgl .g. . . erilleggl taell . . . . . . . 

+++e +e ++ + 1 + +.+ . + + H + + ' + + , ++ + + + :+ + 

44181 548 NSEETMEVKTLGKLeRyklLHILEFDiSDRRRMSvivqapsgekllf akga 597 



+++ ++ +++ +++^ + ++ ++ ++ ++++++++ +++ 

44181. .598 essUpkciggeiektrihvdefalkglrticiayrkitskeyee.idkri 647 



1 



1 



ld.evlglial - dklypgarealkaLke . 

+ ++ ++++ + d +lg+ a++d 1 + +re+++aL+ ■ 

44181 648 feartal^qreeklaavfqf ieKDl ILLGATAVeDRLQDKVRETIEALRM 697 

rGikvailTngdr . naeallealgla 1 If daivdsdevggvgpwvgKPk 
+Gikv++lT-*i+ +++a+ + +++ + ++ + + K + 

44181 - 698 AGIKVWVLTGDKHeTAVS T - VSlSCGHFHRTMNIL- - -ELINQKSD 738 

peif llalerlgvkpeevg ............................... 

e+++++++ ++ e ++ +++ ++++ + ++ + + + + ++ 
44181 739 SEC^QLRQLARR1TE--Dhviqhglvvdgtsislalreheklfmevcrn 786 

• - v ......... .p . kvlmvGDginDapalaaAGvgv 

+ ...+ -+ + + +.. + + +++ +++l+vGDg nD+ ++ A+vg+ 
44181 787 csavlcc rmap 1 qkakv i r 1 ik i spe Kp ITLAVGDGANDVSMI QEAHVG I 836 

1 .' . 

amgngg<-* 

. + 

44181 837 GIMGKE 842 
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•oSTAL W (1 



.74) multiple sequence alignment 



>h67076FL 

•useATlH 

>h44181 



■h67076FL 

useATlH 

•h44181 



- - MFRRS LNRFCAGEEKRVGTRTV FVGN - H P VS ETEAY I AQRFCDNR I VS S K YTL WN F 
MDCSLLRTLVRRYCAGEENVTVDSRTI YYGHKEPPPGAEAY I PQRYPDNRI VSSKYTFWNF' 

MWRW I RQQLG FD PPHQSDTRT I YVAH - RF PQNGLYTPQKF I DNR 1 1 S S KYTVWNF 

: * z . * .*::****:******** 



LjPKNLFEQFRR I AN FY Flil I FLVQVTVDtrPTS P VT^GLPLFFVI TVTA I l4QGYEDWLRHR 
I PKNLFEQFRRI ANFYFL 1 1 FLVpLI I D [TPTS PVTSGLPLFFVI TVTA I I^QG YEP W.LRHK 
V&KNLFEQFRRVANFVFLC I FL VO LiM I DTPTS jP VTS GL PL FFVI frVTA fl KQG YEDWLRHN 
.********** - 44**4*444444 - -********** *4****** ************** 



>h67076FI J 

•useATlH 

>h44181 



>h67076FL 

mseATlH 

>h44181 



>h67076FL 

mseATlH 

3h44181 



>h67076FL 

>useATlH 

>h44181 



3h67076FL 

DUseATlH 

DK44181 



Z)h67076FL 

ouseATlH 

oh44181 



oh6707€FL 

oueeATlH 

bh44181 



bh67076FL 

DUseATlH 

bh44181 



bh67076FL 

ouseATlH 

bh44181 



*>h67076FL 
ouseATlH 



ADNEVNKSTVYI IENAKRVRKESEKI KVGDWEVQADETFPCDLI LLSSCTTDGTCYVTpM 
ADNAMNQCPVHFIQHGKIWRKQSRKLR 

SDNEWGAPVYWRSGGLVKTRSKNIRVGDIVRIA10DEIFPADLVLLSSDRLDGSCHVl|lJ 



, * * ; * 



* * 4 4 **«.*** * * . 4 



AS 

A 

A: 
** 




lESNCKTHYAVRDTIALiCTAESIDTLRAAIECEQPQPDLYKFVGRINIYSNSLEAV 
iESSHKTHYAVQDTKGFHTEADVDSLHATIECEQPQPDLYKFVGRINVYNDLNDPV 
iETNLKTHVAVPETALI^TVANLDTLVAV I ECQQ PEADLYRFMGRM - 1 1 TQQMEE I 
*** ; *** ** : * : * .:*•* *.***:**:.***:*:**: : . : : : 



ARSLGPENLLLKGATLKNTEKI YGVAVYTGMETKMALNYCX5KSQKRSAV^KS I NA^LIVY 
VRPLGSENLL.LRGATLKNTEKI FGVAIYTGMETKMALNYQSKSQKRSAVEKSMNTF LIVY 
VR PLG PES LLLRGARLKNTKE I FGVAVYTGMETKMALNYKS KSQKRSAVEKSNINTl{liIIY 
* ** 4 * # 4* * • ** **** . - * .'* ** • ************ ************* : *** : * 



LFILLTKAAVCTTLKYVVtoSTPYlTOEPWYNQKTQKERETLKVLKMFTDF LSFMVLFNFII 
LCILVSKALINT^KYVvfosEPFRDEPWYNEKTESERQRNLFLRAFTDF LAFMVLFNYI I 
LVILISEAVISTILKYTWp AEEkWDE PWYNQKTEHQRN SS KILRFI SD FlLAFLVIiYNFI I 

44 ... * I"""* * ** " * * - ******:**. ;*; .*; ..***.*;**.*;** 

(*K^r v ^ lv f %Jl ^^rtv ^k^^wk 



PVSMYVTVEfrfeKFLGS FFI SWDKDFYDEEINEGALVNTSDLNEELGQVDYVgg^KTGTLT 
pVSMYVTVEMpKFLGSYFITWDEDMFDEEMGEGPLVOT 
PISLYVTVE^KFLGS FFIGV/DLDL^ 

4.*. ************ *** ** *--. ** • *** ********** ;* : ***4*4 *44 



5 



ENSMEFI ECCIDGHKYKG VTQE VDGLS QTDGTLT YFDKVD — 

ENNMAFKECCI EGHVYVPH - - -VICNGQVLPDSSG- IDM3DSSPGVC _ ^ 

ENEMQFRECSINGMKYQEINGRLVPEGPTPDSSEGNLSYLSSLSH^ 
4* m + * ** # * : * * : ; : :.. 

KNREELFLRALCLCHTVEI KTN DAVDG ATESAELTYI SSSPDEI A 

_ _ _ - - - - GREREELiFFRAI CLCHTVQVKDDHCGDDVDGPQK- S PDAKSCVYI SS S PDEVA 
S PENETEL I KEHDLFFKAVSLCHTVQI SNVQTDCTGDGPWQSNIiAPSQLE YYASS PDEKA 

** *.****** 



.**••*. ***44 



LVKGAKRYGFTFLGNRNGYMRVENQRKEI EE YELLHTLNFDAVRRRMS VI VKTQEGD 1 1A> 
L\n2GVQRLGFTYljRLKDNYWEIljNRE 

LVEAAARI G I VF I GNSEETMEVKTLG - KLERYKLLH I LBFDSDRRRMSVI VQAPSGEKIiL 
**:..**:.:: : * . : . .:*.::**• * ** : ********:: * : * 

. FCKGADSAVFPRVQNHEIELTK\mVERNAMTC^ 
FCKGADSS I F PRV I EGKVDQVRSRVERNAVEGLRTLCVAYIOILEPEQYEDACRIjI^S AKV 
FAKG AE SSI IiPKC I GGE I E KTR I HVDEFAIiKGLRTLC I AYRKFTS KEYEE I DKR I FEART 
*.***:*:;:*: ::: .: :*:. *:.* ****:*::.: ..:** : : • * : 

ALQDREEKMEKVFDDI ETNMNLIGATAVEDKLQDQAAETI EALHAAGLKVWVLTGDKMET 
ALQDREKKLAEAYEQI EKDLiVLLGATAVEDRIjQEKAADTI EALQKAG I KVWVLTGDKMET 
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»h44181 



ALQQREEKLAAy FQF I E KDL I LLGATA VSDRLQD KVRET I EALRpMAG I tfVWVLTGDKHET 

a a a . a . . . a a a * • • ^ »* #» . 



>h67076FL AKSTCYACRLFQTNTELLELTTKTI EESERKEDRLHELI/IEYRKKLLHEFPKSTR - SFKK 

>UseATlH ASATCYACKLFRRSTQLLELTTKKLEEQS LHDVLFDLSKTVLRCSGSMTRDSFSG 

>h44181 - A VS VS LiS CGHFHRTMN I LEL I NQKS DS EC A - EQLRQLARR I T EDHVI 



»h67076FL 

>useAT!H 

>h44181 



>h€7076FL 

>USeATlH 

>h44181 



>h67076FL 

mseATlH 

>h44181 



>h67076FL 

mseATlH 

:>h44181 



?h€707€FL 

DUseATlH 

:>h44181 



:>h67076FL 

>useATlH 

Dh44181 



AWTEHQE YGLI I DGSTLS L»NS SQD - S ? SNNYKS I FLQ I CMKCTAVLCCRM^PLQKAQ I 
LSTDMHDYGLIIDGAALSLIMKPREDGSSSG^rYRELFLEICRNCSAVLCCR^lAPI^QKAQI 
QHGXjWDGTSLSLALREHEK LFMEVCRNCSAVLCCRMAPLQKAKV 

• •..**.-**..*** . . . 4 . - . * .*»** + + ****•*****.. 

a a mm m a mm 'mm • • • • • • a a 

VRMVKNLKGS P I Tli S(^DGANDV^lt,ESHVG I G I KGKEGRQAARNSDYSVPKFKHLKKL, 
VKLI RFS KEHP I TLA I 3DGANDVS M I OEAHVGIGVIGKEGRQAARNSDYA IPKFKHLKKM 

I RLI K I S PEKP I TLiA^pDGANDV^M^EAHVGI G I MGKEGRQAARNSDYA I ARFKFLS KL» 
.*...* ****-..****- *•* * * *■* *-**■***. .** * *" . 

• a a a • «• a a a • * • I • . 



LLAHGHLYYVR I AH LVQYFFYKNLCFI LPQFLYQF FCQFSQQPLYE^^YLTMYNICFTSL 
LLVHGHFYYIRISE LVQYFFYKNiVCFIFPQFLiYQF FCGFSQQTLYDl^YLTLYNISFTSLi 
LFVHGHFYYIRIAT tbyQYFFYKNVCFITPQFL^ 

*. ***.**.**•! *********.*** *******.* **** ***.*** **** 

7^7 



PILAYSLL 2QH INI DTLTSDPRLYMKI SGNAMLQLGP FLYWTFLAAFEGTVFFFGTYFL 
PILLYSLM 2QHVG I DVLKJU^PTL YRD I AKNALLRWRV F I YWTFLG VFDALVF F FGA Y F 1 1 
PILIYSLLP QHVDPHVIjQNKPTLYRDIS KNRLLS I ICI JFLYWTILGFSHAFIFFFGSYLIji 



-QTASLEENGK^/YGNWTFGTI^ 
-ENTTWINGQMEfit^ 
GKX^TSLLGNG C^IFGNWTFGTLVFTVIW 



FFWG3IIWPFLKCX2RMYFVFA^LSSVSTWI^ 

LLWC-fcviWPFLSYQRMYYVF^g^LSSGP 
LF^GILWPFLGSQl^YF fiTF^ 



Dh67076FL - - KNVRRRS AR RNLSCRRASDS- -LSAR PSVRPLLLRTFSDESNVL 

DUseATlH RTQNIQHQDSISEFTPLASLPSWGAQGSRLLAAQCSSPSGRVVCSRWESEECPVIjPLHPG 
oh44181 KAQLTETNAGIKCLDSMCCFPEGEAACAS-VGRMLERVIGRCSPTHISSSWSASDPFYTN 



bh6707 6FL 

DUseATlH 

oh44181 



LPHKARYGCCRSSLEMPT 
DRSILTLSTMDSSTC- - - 
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Input file Fbh67084FL.seq; Output File Fbh67084FL. tra 
Sequence length . 4198 • , , t 

GGAGTCGACCCACGCGTCCGCATTGAGACAATGCCTCCACAAATACTTGAT^ 

AATCACCATTATAGTTTCTGACAAATTGTTCTCAAAAAGG 

M P L M M S . E . E G f'e 'N E E S ; D Y H T L 20 

ATG CCA CTA ATG ATG TCT GAA ? GAA GGC TTT! GAG AAT GAG GAA ,AGT GAT TAC CAC ACC TTA 60 

P R A R I . M Q R K R G L E W F V C D .G W 40 

CCA CGA GCC AGG ATA ATG CAA AGG AAA AGA GGA CTG GAG TGG TTT GTC TGT GAT GGC TGG 120 

K ( F L C T S C C G W L I N I C R R K K E , 60 

AAG TTC CTC TGT ACC AGT TGC TGT GGT TGG CTG ATA AAT ATT TGT CGA AGA AAG AAA GAG 180 

L K A R T V W L G C P E K C E E KH P R 80 

CTG AAA GCT CGC AC A GTA TGG CTT GGA TGT. CCT GAA AAG TGT GAA GAA AAA CAT CCC AGG 24 0 

i • 

N S I K N Q K Y N V F T F ' I P G V L Y E 100 

AAT TCT ATA AAA AAT CAA AAA TAC AAT GTG TTT ACC TTT ATA CCT GGG GTT TTG TAT GAA 300 

• Q ■ F K F F L N L Y F L V I S C S Q F V P 120 

CAA TTC AAG TTT TTC TTG AAT CTC TAT TTT CTA GTG ATA TCC TGC TCA CAG TTT GTA CCA 360 

i 

A L K I G Y L Y T Y W A P L G F V L A V 14 0 

G£A TTG AAA ATA GGC TAT CTC TAC ACC TAC TGG GCT CCT CTG GGA TTT GTC TTG GCT GTT 420 

T M T R E A I D E F R R F Q R D K E V N 160 

ACT ATG ACA CGG GAA GCA ATT GAT GAA TTT CGG CGT TTT CAG CGT GAC AAG GAA GTG AAT 480 

SQLYS K LTVRGKVQVKSS DI 180 

TCA CAA CTA TAT AGC AAG CTT ACA GTA AGA GGT AAA GTG CAA GTT AAG AGT TCA GAC ATA 540 

Q V G D L I I V E K N Q R I P S D M V F 200 

CAA GTT GGA GAC CTC ATC ATA GTG GAA AAG AAT CAA AGA ATT CCA TCG GAC ATG GTG TTT 600 

L R T S E K A G S C F I R T D Q L D G E 220 

CTT AGG ACT TCA GAA AAA GCA GGT TCG TGT TTT ATT CGA ACT GAT CAA CTA GAT GGT GAA 660 

T D W K L K V A V S C T Q Q L P A L G D 240 

ACT GAC TGG AAG CTG AAG GTG GCA GTG AGC TGC ACG CAA CAG CTG CCG GCT CTG GGG GAC 720 

L F S " I S A Y V Y A -Q K P Q M D I H S F 260 

CTT TTT TCT ATC AGT GCT TAT GTT TAT GCT CAG AAA CCA CAA ATG GAC ATT CAC AGT TTC 780 

E G T F T R E D S D P P I H E S L S I E 280 

GAA GGC ACA TTT ACC AGG GAA GAC AGT GAC CCG CCC ATT CAT GAA AGT CTC AGC ATA GAA 840 

NT L W A S T IVASGTVIGVV I Y 300 

AAT ACA TTG TGG GCA AGC ACC ATT GTT GCA TCA GGT ACT GTA ATA GGT GTT GTC ATT TAT 900 

i 

T GKETR SVMNTSN PKNKVGL 320 

ACC GGA AAA GAG ACT CGA AGT GTA ATG AAC ACA TCC AAT CCA AAA AAT AAG GTT GGT TTG 960 

L> D L» EL N R L T K A L F LALVA L S 340 

TTG GAC CTT GAA CTC AAT CGG CTG ACG AAA GCG CTA TTT TTG GCT TTA GTT GCT CTT TCC 1020 
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I . V M V T L Q G F V G P W. ' Y R N L F R F' 360 

ATT GTT ATG GTA ACC TTA CAA GGA TTT GTG GGT CGA TGG ■ TAC , CGC AAT CTT TTT CGG TTC 1080 

L L L F S Y I I P I S L R V N L D M G K 380 

CTT CTC CTC TTT TCT TAC ATC ATT CCC ATA AGT TTG CGT GTG AAC TTG GAC ATG GGC AAA 1140 



W 



M 



M . 



N. 



GCG GTG TAT GGA TGG ATG ATG ATG AAA GAT GAG AAC ATC CCT £GC ACG GTC GTT CGG ACC 

S T I P E E L G R L V Y L L T . D K T G T 
AGC ACT ATC CCA GAG GAA CTT GGG CGC CTG GTG TAT TTA TTG ACA GAC AAA ACA G*3A ACC 



400 
1200 

420 
1260 



L T Q N E M I F K R L H G T V S Y G A . 440 

CTb ^CC CAG AAT GAA ATG ATA TTT ^AG CGG CTG CAC CTG GGC ACC GTG TCC TAT GGC GCC lti20 

D T M DEI Q S H V R D S Y S Q M Q S Q 460 

GAC ACG ATG GAT GAG ATC CAG AGC CAT GTC AGG GAC TCC TAC TCA CAG ATG CAG TCT CAA 1380 

A G G N.N T G S T P L R K A Q S S A P K 480 

GCT GGT GGA AAC AAT ACT GGT TCA ACT CCA CTA AGA AAA GCC CAA TCT TCA GCT CCC AAA 1440 

V R K S V S S R I H E- X V K A I VL C.H 500 

GTT AGG AAA AGT GTC AGT AGT CGA ATC CAT GAA GCC GTG AAA GCC ATC GTG CTG TGT CAC 1500 

N Y T P V Y E S R A G V T Ei E T E F A E 520 

AAC GTG ACC CCC GTG TAT GAG TCT CGG GCC GGC GTT ACT GAG GAG ACT GAG TTC GCA GAG 1560 

A D q D F S D E N R T Y Q A S S P D E V 540 

GCT GAC CAA GAC TTC AGT GAT GAG AAT CGC ACC TAC CAG GCT TCC AGC CCG GAT GAG GTC 1620 

A L V Q W T E S V G L T L V S R D L T S 560 

GCT CTG GTG CAG TGG ACA GAG AGT GTG GGC CTC ACG CTG GTC AGC AGG GAC CTC ACC TCC 1680 



M Q L K T P S G Q V L S F C I h Q L F P 580 

ATG CAG CTG AAG ACC CCC AGT GGC CAG GTC CTC AGC TTC TGC ATT CTG CAG CTG TTT CCC 174 0 

F T S E S K R M G V I V R D E.S TA E I 600 

TTC ACC TCC GAG AGC AAG CGG ATG GGC GTC ATC GTC AGG GAT GAA TCC ACG GCA GAA ATC 1800 



T F y M K G A D V A M S P I V Q Y N D W 

ACA TTC TAC ATG AAG GGC GCT GAC GTG GCC ATG TCT CC£ ATC GTG CAG TAT AAT GAC TGG 

L E E E C G N M A R EG. L* R T L V V A K 

CTG GAA GAG GAG TGC GGA AAC ATG GCT CGC GAA GGA CTG CGG ACC CTC GTG GTT GCA AAG 

K A l T E E Q Y Q D F E S R Y T Q. A K L 

AAG GCG TTG ACA GAG GAG CAG TAC CAG GAC TTT GAG AGC CGA TAC ACT CAA GCC AAG CTG 

S M H D R S L K V A A V V E S L E R E M 

AGC ATG CAC GAC AGG TCC CTC AAG GTG GCC GCG GTA GTC GAG AGC CTG GAG AGG GAG ATG 



E M L R N A G I K I W M L T G D K L E T 
GAG ATG CTG CGC AAC GCC GGG ATC AAG ATA TGG ATG CTA ACA GGC GAT AAA CTC GAG ACA 



720 
2160 
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A T C I A K ' S S H L V* S ' R T Q D I* H I . F 740 

GCT ACC TGC ATT GCC AAA AGT TCA CAT CTC GTG TCT AGA ACA CAA GAT ATT CAT ATT TTC 2220 

r Q V T i S R G E A H L' E L N A F R R K H 760 

AGA CAG GTA ACC AGT CGG GGA GAG GCA CAT TTG GAG CTG AAT GCA TTT CGA AGG AAG CAT 2280 

D C A L V I S G D S L E V C L K Y Y E H 780 

GAT TGT GCA CTA GTC ATA TCT GGG GAC TCT CTG GAG GTT TGT CTA AAG TAC TAC GAG CAT 234 0 

E F V E Lr A C ' Q C P " A V 'V C C R C S P ^ T 800 

GAA TTT GTG GAG CTG GCC TGC CAG I TGC CCT GCC GTG GTT TGC TGC CGC TGC TCA CCC ' ACC 24 0 0 

Q K A R I 'V T L h Q Q H T G R R T C A I 820 

CAG AAG GCC CGC • ATT GTG ACA CTG CTG CAG CAG CAC ACA GGG AGA CGC ACC TGC GCC ATC 24 60 

G D G G N D* V S M I Q A A, D C G I G I E 840 

GGT GAT GGA GQA AAT .GAT GTC AGC ATG ATT CAG GCA GCA GAC TGT GGG ATT GGG ATT GAG 2520 

G K E G K Q A S L» A A D . F S I T Q F R H 860 

GGA AAG GAG GGT * AAA CAG GCC TCG CTG GCG GCC GAC TTC TCC ATC |ACG CAG TTC CGG CAC 2580 

I G R L L M V H G R N S Y K R S A A L G 880 

ATA GGC AGG CTG CTC ATG GTG CA.C fe^G CGG AAC AGC TAC AAG AGG TCG GCG GCA CTC GGC - 2640 

Q F V M H'R.G L I I S T M Q A V F S S V 900 

CAG TTC GTC ATG CAC AGG GGC CTT ATC ATC TCC ACC ATG CAG GCT GTG TTT TCC TCA GTC 2700 

F Y F A S V P X Y Q G F L M V G Y A T I 920 

TTC TAC TTC GCA TCC GTC CCT TTG TAT CAG GGC TTC CTC ATG GTG GGG TAT GCC ACC ATA 2760 

Y T M F P V F S L V L D Q D V K P E M A 94 0 
TAC ACC ATG TTC CCA GTG TTC TCC TTA- GTG CTG GAC ' CAG GAC GTG AAG CCA GAG ATG GCG 2820 

M L Y P E L Y K D L T K G R S L S F K T 960 

ATG CTC TAC CCG GAG CTG TAC AAG GAC CTC ACC AAG GGA AGA TCC TTG TCC TTC AAA ACC 2880 

F L I W V L I S I Y Q G G I L M Y G A D 980 

TTC CTC ATC TGG GTT TTA ATA AGT ATT TAC CAA ' GGC GGC ATC CTC ATG TAT GGG GCC CTG 2940 

V L F E S E F V H V V A I S F T A L I li 1000 
GTG CTC TTC GAG TCT GAG TTC GTC CAC GTG GTG GCC ATC. TCC TTC ACC GCA CTG ATC CTG 3000 

T E L L M V A L T V R T W H W L M V V A 1020 

ACC GAG CTG CTG ATG GTG GCG CTG ACC GTC CGC ACG TGG CAC TGG CTG ATG GTG GTG GCC 3060 

E F L S L G C Y V S S L A F U N E Y F D 1040 

GAG TTC CTC AGC TTA GGC TGC TAC GTG TCC TCA CTC GCT TTT CTC AAT GAA TAT TTT GAT 3120 

v A F I T T V T F L W K V S A I T V V S 1060 

GTT GCC TTT ATC ACC ACC GTG ACC TTC CTG TGG AAA GTG TCG GCG ATC ACC GTG GTC AGC 3180 

c h p L Y V L K Y L R R K S S P P S Y C 1080 

TGC CTC CCG CTG TAT GTC CTC AAG TAC CTG AGG CGC AAG TCT TCT CCT CCC AGC TAC TGC 3240 

K L A S J085 
AAG CTG GCC TCC TAA 

GGGGCTGTGCACCCCCAGCGGGCTGGCCCCAGCACCTTCTGCCCTTCCCAGCA 

FIGURE 32C 
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GGTTTGCCATTGOTACCAAGCAAGCACCACAAGAAAGGGAGGGTAC^ 

• - . | ■ . , r 

GCCTCTCCTTCTCAGTGCAGGGA 

agtgcgag<^ttcacccctgccaggcaagcccagggcatagatgctgagacagcCtctccctctcagtgcagggacgtc 

ACCGCTGCCAGGCAAGCCCAGGGCACAGAGGCCGGGACGGCCTCTCCCTCTCAGTGTGAGGCTTCACCCATGCTAGGCA 

AGCCCAGGGCACAGATGCCGGGATGGCCCCTCCCTCTCAGTGCGGGAACGTCACCCCTGCCAGGCAAGCCCAGGGCACA 

GATGCTGCGATGGCCTCTTCCTCTTAAGTGTGGGGCCrrCACCCCTGCTTTTCTTTCT 

ATTTCCATATTGAAGCAGCTTGAGtTTCTACTGAAAATGAGCCCGAATTATT^ 

ACTCTGGCATTCTGAGAATTAGACT^GAAAGTTTAATTTCTGCAGTTCCCTCATATTCAGATTC 

ACACAAAGTCATTCfcTACTCAAATGTAATAAAATTGAGGCTCCACGGAGAAAAAAAAAAAAAAAAAAAAA 



BNSDOCID: <WO 02055701 A2_l_> 
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Protein Family / Domain Matches, HMMer version 2 



Searching for complete domains in PFAM 

hmmpfam - search a single seq against HMM database 

HMMER 2.1.1 (Dec 1998) 1 1 ' r 

Copyright^ (C) 1992-1998 Washington University School of Medicine 
HMMER is freely distributed under the GNU General Public License (GPL) 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pfam6 .4/Pf am 
/prod/ddm/wspace/orf anal/oa-script . 163 15 . seq 



Query: 67084FL f . , 

I 

Scores for sequence family classification {score includes all domains) : 



Model 



Hydrolase' 

1 * , • 

El^E2_ATPase E1-E2 ATPase 
2 



Description 
haloacid dehalocjenase-like hydrolase 



Score 
19.2 
15.8 



E-value N 
0.0051 
0.00087 



Parsed for domains: 

Model . • Domain seg-f seq-t 



El-E2_ATPase 
El-E2_ATPase 
Hydrolase 



1/2 
2/2 
1/1 



171 
27-? 
410 



199 
843 



hmm.- f hrom- 1 

1 . _ . 

42 70 . . 
105 133 . . 
1. 184 [] 



store 'E-value 



3.0 
13 .0 
19. !2 



6.9 
0. 0064 
0.0051 



Alignments of top- scoring domains: • 
El-E2_ATPase: domain 1 of 2, from 17*1 to 199: score 3.0, E = 6.9 
* - >keeeipaeeLvpGDiVevkpGdrVPADgr< - * 
+ ++++++++++GD+++V+ r+P D++ 
67084FL 171 GKVQVKSSDIQVGDLI IVEKNQRIPSDMV 199 

El-E2_ATPase: domain 2 of 2, from 277 to 305: 'score 13.0, E = 0.0064 
* - >lergnmVf aGTlwsGsltgvVtatGddT<- * 
1 + n+++a+T+v sG+ +gvV+ tG++T 
67084FL 277 LS I ENTLWASTIVASGTVIGWI YTGKET 305 

Hydrolase: domain 1 of 1, from 410 to 843: score 19.2, E - 0.0051 

+ ->ikawFDkDGTLtdgkeppiaeaivealrelgl apleevekl 

+ ++ Dk+GTLt+ + i + + ,+g ++ +++ ++ ++ 

67084FL 410 LVYLLTDKTGTLTQ- -NEMIFKRLHLGTVSYGAdtmdelQSHVRDSY 454 

lgrgl . g . . . erilleggltaell ........... 

+++++++++++ + + ++r+-f +++4-. + ++ ..+++ ++ + + + ++ 

67084FL 455 SQMQSqAggnntgstpLRKAQSSAPKVRKSvssriheavkaivlchnvtp 504 



+++ + +++++ + +■ + + + + +++++ 



+++ + + 



67084FL 505 vyesragvteetef aeadqdf sdenrtyqasspdevalvqwtesvgl tlv 554 



+ + + ++ +++++ + + + + + + + +++++ + + 

67084FL 555 srdl tsmqlktpsgqvlsf cilqlf pf tseskrmgvivrdestaei t f ym 604 



++ + ++ ++ +++ ++ ++ + ++ ++ +++ + +++ 

67084FL 605 kgadvamspivqyndwleeecgnmareglrt lwakkal teeqyqdf esr 654 

Id . evlgl ial . dklypgarealkaLk 

+ + + + +++ + ++ + + + . e+1+,1 ++d+l ++r+ + l+ L+ 

67084 FL 655 ytqakl smhdrslkvaawesleREmELLCLTGVeDQLQADVRPTLEMLR 704 
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erGikvailTngdr .naealle . . ; ; 

+Gik+++1T++ ++a+ ++++++ +++++•+ ++ +++++ + + + 
67084FL 705 NAG I KI WMLTGDRLfcTATClAKs s hi vs r t'qd i h i»f r c^vt s rgea h 1 el n 754 

algla . I fdaivdsdevggvgpwvgKPkpe 

+++ + + ++++ + +1 + +-h++ +++++. +vv+ + +p 
67084FL 755 a f rrkhdcal vi sgds 1 e vCLK- YyEHEFVEIACQCP AWCCRCSPT 800 

i f 1 1 a 1 e r 1 gvkpeevgpkvl mvGDg inDapa 1 aaAGvgvamgngg< - * 
+ +++ 1+ + ++++GDg nD+ ++ aA++g+ + 
67084 Fh 801 QKAR I VTLLQQHTGRR ■ TCAI GpjGGNDVSM IQAIADCGIG I EGKE '843 
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JSTAL/ W (1.74) multiple sequence alignment 



n67084FL 
T2B 



n€7084FL 
T2B 



h67084FL 
T2B 



h67084FL 
T2B 



>h6706i4F 1 L 
,T2B 



>h67084FL 
wT2B 



>h67084FL 
VT2B 



Dh67084FL 
VT2B 



bh67084FL 
AT2B 



bh€7084FL 
AT2B 



bh€7084FIi 
iAT2B 



• , bh67084FL 
1AT2B 



i*bh67084FL 
nAT2B 



?bh67084Fl* 
TAT2B 



Fbh67084FL 



MPLMMSEEGFENEESDYHTLPRARIMQRKR^ 
MPLMMSEEGFENDESDYHTLPRARITRRKJ^ 

************.************ : **********;********** .*.*** :*;♦*** 



LKARTVWLGCPEKCEEKHPRNS I KNQKYNVFTFI PGVLYEQFldFFLNLYFLVISCSQFVP j 
LKARTVWLGCPEKCEEKHPRNS I KNQKYNVFTFI F ^VLYEOFKIFFLNLYFLVVSCSIQ FVP 
******** * * * *****************************************:* ** * * * * 




OVGDLI I VEKNQRI PSDMVFLRTSEKAGSCFlrfltxE* 
QVGDLIIVEKNQRIPSDMVFLRTSEKAGSCFIIql^ 

**************** ******************* ******************:****** 

LFSISAYVYAQKPQMDIHSFEGTFTREDSDPPIHESLSIENTLWASTIVASGTVIGVVIY 
LFS I SAYVYAQKPQLDI HS FEGTFTREDSDPPI HES bS I ENTLWASTI VASGTVI GWI Y 
**************.********************************** *********** 

Tf^3 - 

TGKETRSVMNTSNPKNKVGLLDLELNRLTI^ 



******** ******.**■*********:**********- **:****■****. *******' PTS 

LLLFSYI I PI SLRM NLmjGKAVYGWMMMKDENI PGTWRTSTI PEELGRLVYLL1PKTGT 
LjIiLFSYI I PI gT.RVWLDMQ KAAYGWMIMKDENI PGTWRTSTI PEELGRLVYLLT feCTGT 
******************* ** .**** .********************************* 

LtVdNEMI FKRLHLGTVSYGADTMDE I QSHVRDS YSQMQSQAGGNNTGSTPLRKAQSSAPK 
1,-rtoNEMVFKRLHLGTVSYGTDTMDEIQSHVLNSYLQVHSQPSGHNPSSAPI.RRSQSSTPK 
^**?;.*V*********:********** :** *::**--*:*..*:***: :**•:** 

VRKSVSSRIHEAVKAIVLCHNVTPVYESRAGVTEETEFAEADQDFSDENRTYQASSPDEV 
VKKS VS S RI HEAVKA I ALCHNVTPVYE ARAG I TGETE FAE ADQDFS DENRT YQAS S PDEV 
™;** ********** :***:* ************************** 

ALVQWTESVGLTLVSRDLTSMQLKTPSGQVLSFCILQLFPFTSESKRMGVIVRDESTAEI 
ALVRWTESVGliTLVSRDLASMQLiKTPSGQVliTYCILQMFPFTSESKRMGI IVRDESTAEI 
V** *************** : ************ : : ****:*********** :********** 

TFYMKGADVAMSPIVQYITOWLEEECGNMAREGLRTLVVAKKALTEEQYQDFESRYTQAKIi 
TFYMKGADVAMSTIVQYNDWLEEECGNMAREGLRTL.WAKRTLTEEQYQDFESRYSQAKL 
************.***************************: :*************:**** 

SMHDRSLKVAAWESLEREMELLCLTGVEDQLQADVRPTLEMLRNAGI KI WMLTGDKLET 
q^RALKVAAVVESLEREMELLCLTGVEDQLQADVRPTLEMLRNAGIKIWMLTGDKLET 

ATCI AKSSHL.VSRTQDIHI FRQVTS RGEAHLELiNAFRRKHDCAIiV I SGDSLEVCLKYYEH 
ATCI AKS SHL.VSRTQDIHVFRPVTSRGEAHLEIJJAFRRKHDCALVI SGDSLEVGLRYYEH 
******************:** ********************************* x**** 

EFVEIACQ^PAWCCRCSPTQKARIVTLIiQQHTC^^ 

EtiVELiACQCPAWCCRCS PTXKAH I VTLI<RQHTRKRTCfij^DGGNDV^4ipAADCGIGIE 
*T..*V**** ********** ** : *****:*** .************************* 



CKECKQAS UtAPF^ I tqpph t p.pt.t mvhrr WSYKRS AALGOFVMHR pLI I STM OAVK3S v 
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AT2B 



'bh67084FL 
AT2B 



*bh67084FL 
AT2B 



bh67084FL 
AT2B 



*bh67 084FL 
AT2B 



GKEGKQASiAADFsiTQFRHIGRLIiM\ra^ 

. ** 1W . r ******** 

jD^SVPLYQ<pl>rVGYATIYTME??V^ 

FY FAS VPLYO qFIJ4VG YAT I YTM F P VF S LvElX ^DVKPlEMAI LY PEI»YKDI/TKf3 P g T^S FKT 
************************** * * ****** *V* * * * ************ ******** 

FLI WVL I S I YQGG I LMYGALVLFjBS EFVHf/VA I S FTAL I LTE LLMVAI^VRTWI^ERVvS 
FLI WVLI S I YOGGILMYGAI/ LLFED ^FVhWaI S FTAJj I LTE LLXVAI/ffr l RTWHwfr iMwa 



********************.*** : * * * **************** **** .********** 



EFLSLGCYVSSLAFI^EYF D VT^FITT VTFLW KV S A I TWS CL PL YV i|K Y 

r l5jEYFG I GRVS FGAFLDVA Ffe 



EFLSLGCYVASLAFbFjE YFGIGRVS FGAFLPVAF uTTVTFLWKVSAITVVSCLPLYVT KY 
*********-********* ****************************** 

LRRKSSPPSYCKLAS 1 

LKRKLSPPSYSKLSS 
*.** ***** **.* 
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Input file Fbh67084alt; Qutputf File Fbh67084alt . tra 
Sequence length 4231 

GGAGTCGACCCACGCGTCCGCATTGAGACAATGCCTCCACA^TACTTGATt^ 

■» . 

AATCACCATTATAGTTTCTGACAAATTGTTCTCAAAAAGGTACC 
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I V M V T L Q G F V G P W Y R N L F R F ' 360 

ATT GTT ATG GTA ACC TTA CAA GGA TTT GTG GGT CCA TGG TAC ,CGC. AAT CTT TTT CGG TTC 1080 

L L L ' F S Y I I P I S L R, V N L D M G K 3 80 

CTT CTC CTC TTT TCT TAC ATC ATT CCC ATA AGT TTG CGT GTG AAC TTG GAC ATG GGC AAA 114 0 

A V Y G W M M M K D E N I P G T V V R T 4 00 

GCG GTG TAT GGA TGG ATG ATG ATG AAA GAT GAG AAC ATC CCT GGC ACG GTC GTT CGG ACC 1200 

S T I P E E I* ' G R L V . " Y L L , T D K T G T 420 

AGC ACT ATC CCA GAG GAA CTT GGG CGC CTG GTG TAT TTA TTG ACA GAC AAA ACA GGA ACC 12 60 

L T Q N E M I F K R L H L G TVS Y G A 440 

CTC ACC CAG AAT GAA ATG ATA TTT* AAG CGG CTG CAC CTG GGC ACC" GTG TCC TAT GGC GCC 1320 

I • ' ■'■.•"■»• 

D TiM 0 E I Q S ..H V R D S Y S Q M Q S Q 460 

GAC ACG ATG GAT GAG * ATC CAG AGC CAT GTC AGG GAC TCC TAC TCA CAG ATG CAG TCT CAA 13 8 0 

A G G N.N T G S T P L R K A Q S S A P K 480 

GCT'GGT GGA AAC AAT ACT GGT TCA ACT CCA CTA AGA AAA GCC CAA TCT TCA GCT CCC AAA 1440 

V R K S V S S R I H E A V K A I V L C H . 500 

GTT AGG AAA AGT GTC AGT AGT CGA ATC CAT GAA GCC GTG AAA GCC ATC GTG CTG TGT CAC 1500 

N V T P V Y E S R A G V T E E T E F A E 520 

AAC GTG ACC CCC GTG TAT GAG TCT CGG GCC GGC GTT ACT GA(3 GAG ACT GAG TTC GCA GAG 1560 

A D Q D F S D E N R T Y Q A S S P D E V 54 0 

GCT GAC CAA GAC TTC AGT GAT GAG AAT CGC ACC TAC CAG GCT TCC AGC CCG GAT GAG GTC 1620 

A L V Q W T E S V G L T L V S R D L T S 560 

GCT CTG GTG CAG TGG ACA GAG AGT GTG GGC CTC ACG CTG GTC AGC AGG GAC CTC ACC TCC 1680 

M O L K T P S G Q V h S F C I L Q L F .P 580 

ATG CAG CTG AAG ACC CCC AGT GGC CAG GTC CTC AGC TTC TGC ATT CTG CAG CTG TTT CCC 1740 

F T S ES K RMGV IVRDESTAE I 600 

TTC ACQ TCC GAG AGC AAG CGG ATG GGC GTC ATC GTC AGG GAT GAA TCC ACG GCA GAA ATC 1800 

T F Y M K G A D V A M S P I V Q Y N D W 620 

ACA TTC TAC ATG AAG GGC GCT GAC GTG GCC ATG TCT CCt ATC GTG CAG TAT AAT GAC TGG 18 60 

L E EEC G NMAR E GLRTLVVAK 640 

CTG GAA GAG GAG TGC GGA AAC ATG GCT CGC GAA GGA CTG CGG ACC CTC GTG GTT GCA AAG 1920 

K A L T E E Q Y Q D F E S R Y T Q, A K L, 660 

AAG GCG TTG ACA GAG GAG CAG TAC CAG GAC TTT GAG AGC CGA TAC ACT CAA GCC AAG CTG 1980 

S M H P R S L K V A A V V E S L E R E M 680 

AGC ATG CAC GAC AGG TCC CTC AAG GTG GCC GCG GTA GTC GAG AGC CTG GAG AGG GAG ATG 2040 

E L L C L T G V E D Q b Q A D V R P T L 700 

GAA CTG CTG TGC CTC ACC GGC GTG GAG GAC CAG CTG CAG GCA GAC GTG CGG CCC ACG CTG 2100 

E M L R N A G I K I W M L T G D K L E T 720 

GAG ATG CTG CGC AAC GCC GGG ATC AAG ATA TGG ATG CTA ACA GGC GAT AAA CTC GAG ACA 2160 
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A"» T C I A K S S H L V S R T Q D I H I F 740 

GCT ACC TGC ATT GCC AAA AGT TCA CAT CTC GTG TCT AGA ACA CAA GAT ATT CAT ATT TTC 2220 

R Q V , T S R G E A H L E L .' N A * F R R K H 7 6 0 

AGA CAG GTA ACC AGT CGG GGA GAG GCA CAT . TTG GAG CTG, AAX GCA TTT CGA AGG AAG CAT 2280 

D C A L V I S G D S L E V C L K Y Y E H 780 

GAT TGT GCA CTA GTC ATA TCT GGG GAC TCT CTG GAG GTT TGT CTA AAG TAC TAC GAG CAT 23 40 

E F V E L . A C . . 0 C P A V V * C C R C S F . T 800 

GAA TTT GTG GAG CTG GCC TGC ' CAG TGC CCT GCC GT£ GTT TGC \ TGC CGC TGC TCA CCC ACC 24 00 



K 



H 



CAG AAG GCC CGC ATT GTG ACA CTG CTG CAG CAG CAC ACA GGG AGA CGC ACC TGC GCC ATC 



820 
2460 



G , D G G N D V S M I Q A A D C G I G I E , 84 0 

GGT GAT GGA GGA AAT GAT GTC AGC ATG ATT CAG GCA GCA GAC TGT GGG ATT GGG ATT GAG 2520 

G K E G K Q A S L A A D F S I T Q F R H 860 

GGA AAG GAG GGT AAA CAG GCC TCG CTG GCG GCC GAC TTC TCC ATC ACG CAG TTC CGG CAC 258 0 

I G R L L M V H G R N S Y K R S A A L G 880 

ATA GGC AGG CTG CTC ATG GTG CAC GGG CGG AAC AGC TAC AAG AGG TCG GCG GCA CTC GGC 2 64 0 

Q F V M H R G L I I S T M 0 A V F S S V 900 

CAG TTC GTC ATG CAC AGG GGC CTT ATC ATC TCC ACC ATG CAG GCT GTG TTT TCC TCA GTC 27 00 

i 

F Y F A S V P L Y Q G ' F h M V G YA T I 920 

TTC TAC TTC GCA TCC GTC CCT TTG TAT CAG GGC TTC CTC ATG GTG GGG TAT GCC ACC ATA 27 60 

Y T M F P V F S L V L D Q D V K P E M A 94 0 

TAC ACC ATG TTC CCA GTG TTC TCC TTA GTG CTG GAC CAG GAC GTG AAG CCA GAG ATG GCG 282 0 



M L Y P E L Y K D L T K G R S L S F K T 
ATG CTC TAC CCG GAG CTG TAC AAG GAC CTC ACC AAG GGA AGA TCC TTG TCC TTC AAA ACC 

F L I W V L I S I Y Q G G I L M Y G A L 
TTC CTC ATC TGG GTT TTA ATA AGT ATT TAC CAA GGC GGC ATC CTC ATG TAT GGG GCC CTG 

V L F E S E F V H V V A I S. F T A L I L 
GTG CTC TTC GAG TCT GAG TTC GTC CAC GTG GTG GCC ATC TCC TTC ACC GCA CTG ATC CTG 

T . E L L M V A L T V R T W H W L M V V A 
ACC GAG CTG CTG ATG GTG GCG CTG ACC GTC CGC ACG TGG CAC TGG CTG ATG GTG GTG GCC 

E F L S L G C Y V S > S L A F L N E Y F G 
GAG TTC CTC AGC TTA GGC TGC TAC GTG TCC TCA CTC GCT TTT CTC AAT GAA TAT TTT GGT 

I G R V S F G A F L D V A F I T TV T F 
ATA GGC AGA GTG TCT TTT GGA GCT TTC TTA GAT GTT GCC TTT ATC ACC ACC GTG ACC TTC 

LWKV S AITVVSCLP LYVLKY 
CTG TGG AfAA GTG TCG GCG ATC ACC GTG GTC AGC TGC CTC CCG CTG TAT GTC CTC AAG TAC 

L RRKS S P PSYCKLAS * 
CTG AGG CGC AAG TCT TCT CCT CCC AGC TAC TGC AAG CTG GCC TCC TAA 

GGGGCTGTGCACCCCCAGCGGGCTGGCCCCAGCACCTTCTGCCCTTCCCAGCACCTTGT 
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GGTTTGCGATTGCTACCAAGCAAGCACCACAAGAAAGGGAGGGTACGCCAGGCGAGCCCAGGGCAGAGATGCTGAGACA 

I , r 

GCCTCTCCTTCTCAGTGCAGGGACGTCACCCCTGCCAGGC^GCCCAGGGCACAGATGCCAGGATGGCTTC 

AGTGCGAGGCTTCACCCCTGCCAGGCAAGGCCAGGGCATAGATGCTGAGACAGCC 

ACCCCTGCCAGGCAAGCCCAGGGCACAGAGGCCGGGACGGCCTCTCCCTCTCAGTGTGAGGCTTCACCCATGCTAGGCA 

i 

AGCCCAGGGCACAGATGCCGGGATGGCQCCTCCCTCTCAGTGCGGGAACGTCACCCCTGCCAGGCAAGCCCiAGGGCACA 

I. * ' ' • ' , 

GATGCTGCGATGGCCTCTTCCTCTTAAGTGTGGGGCCTCACCCCTGCTTTTCTTTCTTT^ 

i " .. . . 

atttccatattgaagcagcttgagtttgtactgaaaatgagcccgaattatttc 

actctggcattctgagaattagadtgaaagtttaatttctgcagttccctcatattc 

acacaaagtcattCctactcaaatgtaataamttgaggctc 
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HMMER 2.1.1 (Dec 1938) I i , r 

Copyright (C) 1992-1998 Washington University School qf Medicine 
HMMER is 'freely distributed under the GNU General Public License (GPL) - . 



HMM file: 
Sequence file: 



/prod/ddm/seqanal/PFAM/pfam6.4/Pfam 
/prod/ddm/wspace/orfanal/oa-script .17118 .seq 



Query: 67084alt 

Scores for sequence family' classification (score includes all domains) 
Model 



Hydrolase 
1 

El-E2_ATPase E1-E2 ATPase 
2 



Description 
haloacid dehalogenase-like hydrolase 



Score 
19.2 
15.8 



E-value N 
0.0051 
0.00087 . 



Parsed for domains : 

Model Domain seq- f seq-t 



hmm-f hmm-t 



score i E -value 



El-E2_ATPase 
El-E2_ATPase 
Hydrolase 



1/2 
'2/2 
1/1 



171 199 
277 | 305 
4l6 843 



" - - r 
42 
105 
1 



.70 
133 . . 
184 [) 



3.0 
13 .0 
19 J2 



6.9 
0 .0064 
0 .0051 



Alignments of top- scoring domains: 

El-E2_ATPase: domain 1 of 2, from 171 to 199: score 3.0, E m 6.9 
* - >keeeipaeeLvpGDiVevkpGdrVPADgr< - * 
+ ++++++++++GD+++V+ r+P D++ 
67084alt 171 GKVQVKSSDIQVGDIiI I VEKNQRI PSDMV 199 

El-E2_ATPase: domain 2 of ? , from 277 to 305:. score 13.0, E = 0.0064 
* - >lergnmVfaGTlwsGsl , tgvVtatGddT<r-* 
1 + n+++a+T+v sG+ +gvV+ tG++T 
67084a! t 277 LSIENTLWASTIVASGTVIGWIYTGKET 305 

Hydrolase: domain l of l, from 410 to 843: score 19.2, E a 0.0051 

*->ikawFDkDGTLtdgkeppiaeaivealrelgl. .'. . .apleevekl 
+ ++ Dk+GTLt+. + i + + +g ++ +++ ++ ++ 

67084alt 410 LVYLLTDKTGTLTQ- - NEMI FKRLHLGTVS YGAdtmde IQSHVRDS Y 454 

Igrgl.g. erilleggltaell.,;. . 

+++++++++++++++r++++++ ++ + ++ ++ 

67084alt 455 SQMQSqAggnntgstpLRKAQSSAPKVRKSvssriheavkaivlchnvtp 504 



+ + + + + + + + + + 4- + + + + + + + 



+++ + + 



67084alt 505 vyesragvteetef aeadqdf sdenrtyqasspdevalvqwtesvgl tlv 554 



+++ ++ +++++ + + +++++ + + + +++ + + + 

67084alt 555 srdl tsmqlktpsgqvlsf cilqlf pf tseskrmgvivrdestaeitf ym 604 



++ + ++ ++ +++ ++ +++ ++ ++ +++ + + ++ 
6 7084alt 605 kgadvamspivqyndwleeecgnmareglrtlvvakkalteeqy-qdf esr 654 

- - Id.evlglial .dklypgarealkaLk 

+ + + + +++ + ++ + + + e+1+1 ++d+l +-t-r++l+ L+ 

67084alt 655 ytqaklsmhdrslkvaavvesleREmELLCLTGVeDQLQADVRPTLEMLR 704 
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erGikvailTngdr .naealle . . 

+Gik++ + +a.+ + + + + ++ + + + + + + 4- 4- A -I. J. ' _1_ > . 

67084a^t 705 NAG I KI WMLTGDKLeTATCl AKs shl vs r tqd i hi f rqv t s rgeah 1 el n 754 

algla . If daivdsdevggvgpvvvgKPkpe 

+ + + ++. + + + + ;■+ +1 + + + + + + + + + + +W+ + +p 

67064alt 755 af rrkhdcalvisgdslevCLK- YyEHEFVELAGQCP- - -AWCCRCSPT 800 

•• ' 

if llalerlgvkpeev^pkvlmvGDginDapalaaAGvgvamgngg<-* 
+ +++ II +. + ++++GDg nD+ aA++g+ +. 
67084alt 801 QKARIVTLLQQHTGRR TCAIGDGGNDVSMIQAADCGIGIEGKE 843 
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CLUSTXli W (1.74) multiple sequence alignment 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



F,bh67084alt 
mAT2B 



Fbh67984alt 
mAT2B 



Fbh67084alt 
mAT2B ' 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Fbh67084alt 
mAT2B 



Ebh67084alt 
mAT2B 



Fbh67084alt 
tnAT2B 



Fbh67084alt 



M P LMMS EEGFENEESD YHTLPRAR I MQR KRGLEWFVCIX3WKFLCTSCCGWL INI CJRRKKE 
M PIJ^SEEGFENDESD YHTLPRAR I TRRKRGLEWFW 

.***;******* ********** ^ * * * * .*.*★** 



LKARTVWLGCPEKCEEKHPRNSI KNOKYNVFTFl PoVLYEQFKE^ ; L^LYFLVISCS QFVp3 
************************ *V* ************************* : ****** * 



ALK^GYI^ 

^[lI g^gylyt ywapl gfvlavtiar eaidefrrfqrpkemnsqlyskltvrgkvqvkssdi 

* * * * * * * * * * * * * * * * * * * * * : : * * ************* .***** * * * ** *********** . 



qvgdliiveknqripsdmvflrtsekagscfl) 
qvgdliiveknqripsdmvflrtsekagscfir} 



>qUdGETDWKLKVAVS CTQQLPA 
>QyDGETDWKLKVAVS CTQRL PALGD 



*****************************************************:****** 

LFSISAYVYAQKPQMDIHSFEGTFTREDSDPPIHESLSIENTLWASTIVASGTVIGVVIY 
LFS I S AYVYAQKPQLD I HS FEGTFTREDSDP P I HES LS I ENTLWASTI VASGTV I GW I Y 
* * ********* ***,********************************************* 

TGKETRSVMNTSNPKNKVGLLDLELN^ 
TfiKETRSVMNTSNPNNK^GIAiPLEL^^ 

**************-**** * * ***** • * * * * * * * * **. ** : * ******* m ********* *. . 



T iLLFSYI I PISLR\ ^LDMG KAVYGWMMMKDENI PGTWRTSTI PEELGRLVYLL^DKTGT 
LLIjFSYIIPI SLR\^LDM^ KAAYGWMIMKDENIPGTVVRTSTIPBE3^RIjVY1iLT iDKTGT 
*• ***★ »» »*****★ *** * * ** ,**** .********************************* 



QNEM I FKRLHLGTVS YGADTMDE I QS HVRDS YSQMQSQAGGNNTGSTPLRKAQSS APK 
LrdQNEMVFKRLHLGTVSYGTDTMDEIQSHVLNSYLQVHSQPSGHNPSSAPLRRSQSSTPK 
W*.* ** ; ************;******** ** ;** *;:**..*:*..*:***::***:** 

VRKSVSSRIHEAVKAIVLCHNVTPVYESRAGVTEETEFAEADQDFSDENRTYQASSPDEV 
VKKS VS SR I HEAVKAI ALCHNVT PVYEARAG I TGETEF AEADQD FS DENRTYQAS S PDEV 
* . ************** m ********** : *** : * ************************** 

ALVQWTESVGLTLVSRDLTSMQIiKTPSGQVLSFCILQLFPFTSESKRMGVIVRDESTAEI 
ALVRWTESVGLTLVSRDLASMQLKTPSGQVLTYGILQMFPFTSESKRMGIIVRDESTAEI 
*** .**************;************: : ****:***********:********* * 

TFYMKG ADVAMS P I VQYNDWLEEECGNMAREGIiRTLVVAKKALTEEQYQDFE SRYTQAKL 
TFYMKGADVAMSTIVQYNDWLEEECGNMAREGIjRTIiVVAKRTLTEEQYQDFESRYSQAKL 
************.*************************** : : ************* : **** 

SMHDRSLKVAAVVESLEREMELLCLTGVEDQIjQADVRPTLEMLRNAGI ki wmltgdklet 
SIHDRALKVAAVVESLEREMELLCIjTGVEDQLQADVRPTLEMLRNAGI ki wmltgdklet 
*.***.****************************************************** 

ATCIAKS SHLVSRTQDIHI FRQWSRGEAHLEL^^ sgdslevclkyyeh 
ATCIAKSSHLVSRTQDIHVFRPVTSRGEAHLELNAFI^^ 

****************** : ** **^**^******^^^t { ^T4^cyt*^* 

ef^lacqcpawccrcsptqkarivtli/^qhtgrrtc^gdg^ 

EL»VE1»ACQC P AWCCRCS PTXKAH I VTLLRQHTR^RTCWlJGIXXS^VSw 1 QAADCG I G I E 

*.****************** **;*****;***. .****V********W*** ******* 

GKEGKQASLAADFS I TQFRHI GRLLMVHGRNSYK^SAAl^FVMHI ^LI ISTMQ^^SSV 
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AT2B 



bh67084aj.t 
tAT2B 



bh67084alt 
AT2B 



■bh67084alt 
AT2B 



•bh67084alt 
AT2B . 



GKEGKQASLAADFS I TQFRHIGRLLMS^GRNSYKJfcSAJ ^ ISlTM OA^ggX/ 
*************** ** **************** * * * * * * ********* * * * * * * anrir? * 



* * * * * * ********************** * * * * * * ****** . ******************* 



FL.I WVLI SI YQGG I LM YGALVI^S EFVllv^ 
FIiTWVTiT fit YOOn T T,M YGALg XFEDEyWVV- 



'A IS FTALI LTELLMVA 
A I S FTAL I LTELLXVAL' 



LfrVRTWl&I^MVVA 
LTO RTWHW tLMVVA 



********************* * * * * * **'* * *.* ************* * * * * *********** 

7W*r to. .. ' 

E FLS LGC Y VS SLA FljsiE Y FG i GRVS ^GAFLDV^FITTVTFLWKVSAI IVVSCLPlTF^KY 
E FLS LGC Y VAS LA FLNfe Y FG I GRVS FG AFT .nVA^ TTVTFLWKVS A T TVVgr r ,py ^^Trv 



******** - * ********************** ********************* ****** 

t 

LRRKSSPPSYCKLAS 
LKRKLSPPSYSKLSS 
* . ** ***** ** . * 
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SEQUENCE LISTING 
<110> Millennium Pharmaceuticals, Inc. 

-~- Curtis, Rury A.J . ( t . f , ■ 

<120> 13099, ,46455, 54414, 53763, 6707*6, 67102, 44181, 

-". 67084FL, AND 67084 ALT, HUMAN PROTEINS AND' METHODS OF 
USE THEREOF 

<130> MNI-214CPPC ■ , 

<150> US 60/256,240 , » ' » . 

<151> 2000-12-15 . 

<150> US 60^256, 588 
<151> 2000-12-18 . \ 

<150> US 60/258,028 

<151> 2000-12-21 -* 
<160> 40 * , » 

<170> FastSEQ for Windows Version 4 . 0 

<210> 1 * ' 1 

<211> 2725 ' - 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (180) . . . (2033) • t ' ■ ' . 

<4 00> 1 

ccacgcgtcc ggccttccga aatagaaaca aagttggtca caaatcacat tagctttgcc 60 
cgaagttttt ccccacactc ttctttagca tgctattatg gggaaagtga ccactcctgg 120 
gagcgggggt ggtcggggcg gtttggtggc ggggaagcgg ctgtaacttc tacgtgacc 17 9 
atg gta cct gtt gaa aac acc gag ggc ccc agt ctg t ctg aac cag aag 227 
Met Val Pro Val Glu Asn Thr Glu Gly Pro Ser Leu Le"u Asn Gin Lys 
1 5 10 15 

ggg aca gcc gtg gag acg gag ggc age ggc age egg cat cct ccc tgg 275 
Gly Thr Ala Val Glu Thr Glu Gly Ser Gly Ser Arg His Pro Pro Trp 
20 25 30 

gcg aga ggc tgc ggc atg ttt acc ttc ctg tea tct gtc act get get 323 
Ala Arg Gly Cys Gly Met Phe Thr Phe Leu Ser Ser Val Thr Ala Ala 
35 40 45 

gtc agt ggc etc ctg gtg ggt tat gaa ctt ggg ate ate tct ggg get 371 
Val Ser Gly Leu Leu Val Gly Tyr Glu Leu Gly He lie Ser Gly Ala 
50 55 60 

ctt ctt cag ate aaa acc tta tta gcc ctg age tgc cat gag cag gaa 419 
Leu Leu Gin He Lys Thr Leu Leu Ala Leu Ser Cys His Glu Gin Glu 
65 70 75 80 

atg gtt gtg age tec etc gtc att gga gcc etc ctt gcc tea etc acc 4 67 
Met Val Val Ser Ser Leu Val lie Gly Ala Leu Leu Ala Ser Leu Thr 
85 90 95 
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gga ggg gtc ctg ata gac aga tat gga aga agg aca gca ate ate ttg 515 

Gly Gly Val Leu lie ( $sp Arg ,,Tyr Gly Arg Arg Thr Ala lie lie Leu . 
100 105 ..HO 

: » i , r • 

tea tec tgc ctg ctt gga etc gga age tta gtc ttg ate etc agt tta 563 

Ser Ser Cys » Leu Leu Gly Leu Gly Ser Leu Val Leu lie Leu Ser Leu . 

115 120 i25 

tec tac acg gtt ctt ata gtg gga cgc att gee ata ggg gtc tec ate 611 

Ser Tyr Thr Val Leu lie Val Gly Arg He Ala He iGly Val Ser He 

130 . 135 , 140 

I » * . ' • 

tec etc tct tec att gec act tgt gtt tac ate gca gag att get cct 659 

Ser Leu Ser Ser 'He Ala Thr Cys Val Tyr He Ala Glu He Ala Pro 
145 .150 155 160 

caa cac aga aga ggc ctt ctt gtg tea ctg aat gag ctg atg att gtc 707 

Gin His Arg Arg Gly Leu Leu Val Ser Leu Asn Glu Leu Met lie Val 

165 170 175 

ate ggc att ctt tct gec tat att tea aat tac gca ttt get aat gtt 755 

He Gly He Leu Ser Ala Tyr lie Ser Ash Tyr Ala Phe Ala Asn Val 
180 ■ ( , : 185 190 

• ,- • 

ttc cat ggc tgg aag tac atg ttt ggt ctt' gtg att ' ccc ttg gga gtt! 803 

Phe His Gly Trp Lys'Tyr Met Phe Gly Leu Val lie Pro Leu Gly Val 
195 200 - 205 

ttg caa gca att gca atg tat ttt ctt cct cca age cct egg ttt ctg 851 

Leu Gin Ala He Ala Met Tyr Phe Leu Pro Pro Ser Pro Arg Phe Leu 

210 215 220 

gtg atg aaa gga caa gag gga get get age aag gtt ctt gga agg tta 899 

Val Met Lys Gly Gin Glu Gly Ala Ala Ser Lys Val Leu Gly Arg Leu 
225 230 235 240 

aga gca etc tea gat aca act gag gaa etc act gtg ate aaa tec tec 94 7 

Arg Ala Leu Ser Asp Thr Thr Glu Glu Leu Thr Val, He Lys Ser Ser 

245 250 255 

ctg aaa gat gaa tat cag tac agt ttt tgg gat ctg ttt cgt tea aaa 995 

Leu Lys Asp Glu Tyr Gin Tyr Ser Phe Trp Asp Leu Phe Arg Ser Lys 

260 265 . . . 270 

gac aac atg egg ace cga ata atg ata gga eta aca eta gta ttt ttt 1043 

Asp Asn Met Arg Thr Arg He Met He Gly Leu Thr Leu Val Phe Phe 
275 280 285 

gta caa ate act ggc caa cca aac ata ttg ttc tat gca tea act gtt 1091 

Val Gin lie Thr Gly Gin Pro Asn He Leu Phe Tyr Ala Ser Thr Val 

290 295 300 

ttg aag tea gtt gga ttt caa age aat gag gca get age etc gee tec 1139 

Leu Lys Ser Val Gly Phe Gin Ser Asn Glu Ala Ala Ser Leu Ala Ser 
305 310 315 320 

act ggg gtt gga gtc gtc aag gtc att age acc ate cct gee act ctt 1187 

Thr Gly Val Gly Val Val Lys Val He Ser Thr He Pro Ala Thr Leu 

325 330 335 
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ctt gta gac cat gtc ggc age aaa aca ttc etc tgc att ggc tec tct 
Leu Val Asp His Val Gly Ser Lys Thr Phe Leu Cys lie Gly Ser Ser 
340 345 ! 350 



1235 



gtg atg gca get teg ttg gtg ace a'tg ggc ate* igta aat • etc aac ate 1283 
Val Met 'Ala Ala Ser Leu Val Thr Met Gly He Val* Asn Leu Asn He 
355 360 ' 365 



cac atg aac ttc acc cat ate tgc aga age cac aat tct ate aac cag 
His Met Asn Phe Thr His lie. Cys Arg Ser His Asn Ser lie Asn Gin 
370 375 1 • 380 , ; 



acc tea gca tec ttg eta aat get gga tta age cac act gaa tac cag 
Thr Ser Ala Ser Leu Leu Asn Ala Gly Leu Ser His. Thr Glu Tyr Gin 
435 440 445 



tta gee age ttg ctt gtt tat gtt get get ttt tea att ggt eta gga 
Leu Ala Ser Leu Leu Val Tyr Val Ala Ala Phe Ser He Gly Leu Gly 
465 470 475 480 

cca atg ccc tgg ctg gtg etc age gag ate ttt cct ggt ggg ate aga 
Pro Met Pro Trp Leu Val Leu Ser Glu lie. Phe Pro Gly Gly He Arg 
485 490 495 



etc ate teg ctg aca ttt ttg act gta, act gat ctt att ggc ctg cca 

Leu He Ser Leu Thr Phe Leu Thr Val Thr Asp Leu He Gly Leu Pro 
515 520 525 

tgg gtg tgc ttt ata tat aca ate atg agt eta gca tec ctg ctt ttt 

Trp Val Cys Phe lie Tyr Thr lie Met Ser Leu Ala Ser Leu Leu Phe 
530 535 540 

gtt gtt atg ttt ata cct gag aca aag gga tgc tct ttg gaa caa ata 

Val Val Met Phe He Pro Glu Thr Lys Gly Cys Ser Leu Glu Gin lie 

545 550 555 560 

tea atg gag eta gca aaa gtg aac tat gtg aaa aac aac att tgt ttt 

Ser Met Glu Leu Ala Lys Val Asn Tyr Val Lys Asn Asn He Cys Phe 
565 570 575 



1331 



tec ttg gat gag tct gtg att tat gga cca gga aac ctg tea acc aac 1379 

Ser Leu Asp Glu Ser Val He Tyr Gly Pro Gly Asn Leu Ser Thr Asn 
385 390 395 400 

ajac aat act etc aga gac cac ttc aaa ggg att tct tec cat age' aga. 1427 

Asn" Asn Thr Leu Arg Asp His Phe Lys Gly He Ser Ser His Ser Arg 
'405 410 415 

,agp tea etc atg ccc ctg aga aat gat gtg gat aag aga ggg gag acg 1475 

Ser Ser Leu Met Pro Leu Arg Asn Asp Val Asp Lys Arg Gly Glu Thr 

420 425 430 



1523 



ata gtc aca gac cct ggg gac gtc cca get ttt ttg aaa tgg ctg tec 1571 
He Val Thr Asp Pro Gly Asp Val Pro Ala Phe Leu Lys Trp Leu Ser 
450 ' 455 460 



1619 



1667 



gga cga gee atg get tta act tct age atg aac tgg ggc ate aat etc 1715 
Gly Arg Ala Met Ala Leu Thr Ser Ser Met Asn Tzj? Gly He Asn Leu 
500 505 510 



1763 



1811 



1859 



1907 
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atg agt cab cac caa gaa gaa tta gtg cca aaa cag cct caa a'aa aga 1955 

Met Ser His His Gin Glu Gdu .^eu Val Pro hys Gin Pro Gin Lys Arg . 

580 1 1 585 ' 590 . . 

aaa ccc cag gag cag etc ttg gag tgtaac aag ctg tgt ggt agg *ggc 2003 

Lys Pro Gin • Glu Gin Leu Leu Glu Cys ksn Lys Leu Cys Gly Arg Gly . 
595 600 '605 



caa tec agg cag ctt tct cca gag acc taa tggcctcaac accttctgaa 
Gin Ser Arg Gin Leu Ser Pro Glu Thr * , 
610 615 



cgtggatagt 
tgtgctctct 
cagectttaa 
ctaagtgatc 
ttaatacttt 
atcaacctcc 
acacaagtgg 
atettacgea 
gattaggaga 
taaagattat 
tatgaatttc 
aagggeggee 



gccagaacac 
tttcagtgtc 
^ccccctcct 
tctttttctig 
ctacctt'ctt 
gecttaaget 
tctggacatg 
aaaaagaacc 
gggtcct.ggc 
agagcttatt 
atgttaatat 
gc 



ttaggagggt 
atggaactgg 
ccccagaagg 
agcaggatat 
cacagagcag 
atgtatgtat 
agggtacagt 
agaacacaca 
taggatttta 
ttatgaacta 
tgtgaaatat 



gtctttggac 
ttttgaagag 
aacctcaaaa 
caggttaaaa 
cctttgaata 
ggaggccagt 
ttctgcctac 
gtgtggacaa 
gtgg.taattc 
taaactataa 
taaaataatt 



caatgeatag 

acactctgaa 

ggtagatgag 

aaaaaaagtt 

gactatgtcc 

cgcagcttta 

caagacacta 

ctgcccatat 

ctagttacati 

tttaatgcaa 

ccgcaataaa 



ttgcgactcc 
atgataaaga 
gtacaaggtc 
actggctggt 
tagtgaagac 
ttatgeagae 
ettgeactgg 
attctatcta 
tcaacaagta 
aatatccttt 
aaaaaaaaaa 



2053 



* » 

2113 
217 3 
2233 
2293 
2353 
2413 
2473 , 
2 533 
2593 
2653 
2713 
2725 • 



<210> 2 

<211> 617 

<212> PRT 

<213> Homo sapiens 



<400> 2 






























Met 


Val 


Pro 


Val 


Glu 


Asn 


Thr 


Glu 


Gly 


Pro 


Set 


Leu 


Leu 


Asn 


Gin 




1 








5 










10 










15 




Gly 


Thr 


Ala 


Val 


Glu 


Thr 


Glu 


Gly 


Ser 


Gly 


Ser 


Arg 


His 


Pro 


Pro 


Trp 






20 










25 










30 






Ala 


Arg 


Gly Cys 


Gly 


Met 


Phe 


Thr 


Phe 


Leu 


Ser 


Ser 


Val 


Thr 


Ala 


Ala 






35 










40 










45 








Val 


Ser 


Gly 


Leu 


Leu 


Val 


Gly 


Tyr 


Glu 


LeO 


Gly 


lie 


( Ile 


Ser 


Gly Ala 




50 










55. 










60 










Leu 


Leu 


Gin 


lie 


Lys 


Thr 


Leu 


Leu 


'Ala 


Leu 


Ser 


Cys 


His 


Glu 


Gin 


Glu 


65 










70 










75 










80 


Met 


Val. 


Val 


Ser 


Ser 


Leu 


Val 


He 


Gly 


Ala 


Leu 


Leu 


Ala 


Ser 


Leu 


Thr 










85 










90 










95 




Gly 


Gly 


Val 


Leu 


He 


Asp 


Arg 


Tyr 


Gly 


Arg 


Arg 


Thr 


Ala 


He 


He 


Leu 








100 










105 










110 






Ser 


Ser 


Gys 


Leu 


Leu 


Gly 


Leu 


Gly 


Ser 


Leu 


Val 


Leu 


He 


Leu 


Ser 


Leu 






115 










120 










125 








Ser 


Tyr 


Thr 


Val 


Leu 


He 


Val 


Gly 


Arg 


He 


Ala 


lie Gly Val 


Ser 


He 




130 










135 










140 










Ser 


Leu 


Ser 


Ser 


He 


Ala 


Thr 


Cys 


Val 


.Tyr 


He 


Ala 


Glu 


lie 


Ala 


Pro 


145 










150 










155 










160 


Gin 


His 


Arg 


Arg 


Gly Leu 


Leu 


Val 


Ser 


Leu 


Asn 


Glu 


Leu 


Met 


lie 


Val 










165 










170 










175 




lie 


Gly 


lie 


Leu 


Ser 


Ala 


Tyr 


He 


Ser 


Asn 


Tyr 


Ala 


Phe 


Ala 


Asn 


Val 








180 










185 










190 






Phe 


His 


Gly Trp 


Lys 


Tyr 


Met 


Phe 


Gly 


Leu 


Val 


He 


Pro 


Leu 


Gly 


Val 






195 










200 










205 








Leu 


Gin 


Ala 


He 


Ala 


Met 


Tyr 


Phe 


Leu 


Pro 


Pro 


Ser 


Pro Arg 


Phe 


Leu 




210 










215 










220 










Val 


Met 


Lys 


Gly Gin 


Glu 


Gly Ala 


Ala 


Ser 


Lys 


Val 


Leu 


Gly 


Arg 


Leu 


225 










230 










235 










240 
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Arg Ala 


Leu 


Ser 


Asp 


Thr 


Thr 

A H JL 


Glu 


Glu 


Leu 


Thr 


Val 


lie 


Lys 


Ser 


Ser 










245 










250 










255 




M T on 


T.\/ <? 


Asp 


Glu 


Tyr 


Gin 




O Ci 


Phe 


Trp Asp 


Leu 


Phe Arg Ser 


Lys 








-2~6XT 










265 










270 






Asp 


Asn 


Met Arg Thr Arg 


lie 




lie 


Gly Leu' Thr Led Val Phe 


Phe 






275 










280 










285 








Val 


Gin 


He 


Thr 


Gly Gin 


Pro 


Asn 


lie 


Leu Phe Tyr Ala Ser Thr 


Val 




290 










295 










300 










Leu 


Lys 


Ser 


Val 


Glv 


Phe 


Gin 


Ser 


Asn 


Glu 


Ala 


Ala 


Ser 


Leu 


Ala 


Ser 


305 










310 










315. 










320 


Thr 


Glv 


Val Gly 


Val 


Val 


Lys 


Val 


ii4 


Ser* Thr 


lie. Pro 


Ala 


Thr 


Leu 










325 




















335 




Leu 


Val 


Asp 


n jlo 


Val 


Gly Ser 


Lys 


Thr 


jrne 


Leu 


Cys 


Tip 


Glv 

ox y 


Ser 


Ser 








J5 H U 










34 5 










350 






v a jl 


Met 


Ai.a 

*k ^ 
ODD 


HJia 


Ser 


Leu 


Val 
i 


Thr 
360 


Met 


r*T if 


He 


Val 


Asn 
365 


Leu 


As h 


lie 


Hi <; 


L id L. 


Asn 


rne 


Thr 


His 


lie 


Cys 


Arg 


Ser 


His ' Asn 


Ser 


T Vp 

jl jl e 


Ben 


Gin 


I 


370 










37 5 








380 










O o Y~ 


Leu 


Asp 


CalU 


Ser 


Val 


He 


Tyr 


Gly 


Pro 


.Gly 


Asn 


Leu 




i nr 


Asn 


385 










390 










395 










4 nn 


7H cm 


n o n 
no i i 


Thr 


Leu 


Arg Asp His 


Phe 


Lys 


Gly 


lie 


Ser 


Ser 


His 


Ser 


Arg 










4 05 










410 










415 






Q o r- 


Leu 


Met 
4 20 


Pro 


Leu 


Arg 




no £s 


Val 


Asp 


Lys 


Arg 


Gly 
,4 30 


o J-U 


Th r 


1 iiJ. 


Ser 


Ala 
435 


Ser 


Leu 


Leu 


Asn 


Ala 
A AC\ 


oiy 


Leu 


Ser 


His 


Thr 
445 


Glu 


Tyr 


o JL 11 


Tin 

jl xe 


v a jl 


Thr 


Asp 


Pro Gly 


Asp 


\7a 1 
vdl 


JTX. \J 


Ala 


Phe 


Leu 


Lys 


Trp 


Leu 






4 50 










455 










460 








Leu 


nia 


Ser 


Leu 


Leu 


Val 


Tyr 


Val 


Ala 


Ala 


Phe 


Ser 


He 


Gly 


beu 


Gly 


465 










470 










475 










/DA 


Pro 


Mot- 


Pro 


Trp 


Leu 


Val 


Leu 


Ser 


Glu 


He 


Phe 


Pro 


Gly Gly 


i le 


Arg 










485 










4 90 










495 






Arg 


Ala 


Met 
500 


Ala 


Leu 


Thr 


Ser 


Ser 

DKJD 


Met 


Asn 


Trp 


Gly 


He 
510 


Asn 


Leu 


Leu 


Tip 
lie 


Ser 
515 


Leu 


Thr 


Phe 


Leu 


J. 1 J J. 
tort 

D /L U 


vd J. 


Thr 


Asp 


Leu 


He 
525 


Gly 


Leu 


Prn 

c JLU 


irp 


v a X 

530 


Cys 


Phe 


He 


Tyr 


Thr 
535 


Tie 


Met 


Ser 


Leu 


Ala 
540 


Ser 


Leu 


Leu 


Phe 


V clx 




Met' 


Phe 


lie 


Pro 


Glu 


Thr 


Lys 


Gly 


Cys 


Ser 


Leu 


Glu 


Gin 


lie 


545 










550 










555 










JDU 






Glu 


Leu 


Ala 
565 


Lys 


Val 


Asn 


Tyr 


Val 
570 


Lys 


Asn 


Asn 


lie 


Cys 
575 


Php 




Ser 


His 


His 
580 


Gin 


Glu 


Glu 


Leu 


Val 
585 


Pro 


Lys 


Gin 


Pro 


Gin 
590 


Lys 


Arg 


Lys 


Pro 


Gin 


Glu 


Gin 


Leu 


Leu 


Glu Cys 


Asn 


Lys 


Leu 


Cys 


Gly 


Arg 


Gly 






595 










600 










605 








Gin 


Ser 
610 


Arg 


Gin 


Leu 


Ser 


Pro 
615 


Glu 


Thr 
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atg gta cct gtt gaa aac acc gag ggc ccc agt ctg ctg ..aac cag aag 48 
Met Val Pro Val Glu A'sn Thr Glu Gly Pro Ser, Leu Leu Ash Gin Lys 
1 5 .10 15 

ggg aea gee gtg gag acg gag ggc age ggc age egg cat cct ccc tgg 96 
Gly Thr Ala Val Glu Thr Glu Gly Ser Gly Ser Arg His Pro '• Pro Trp 
20 25 30 ' 

gcg aga ggc tgc ggc atg ttt acc ttc ctg tea tct gtc act get get 144 
Ala Arg Gly Cys Gly Met Phe Thr Phe Leu Ser . Ser Val Thr Ala Ala 

35 40 ' ' ' . ■ .45 ; 

• . - i » 

gtc agt ggc etc ctg gtg ggt tat gaa ctt ggg ate ate tct ggg get 192 

Val Ser Gly Leu Leu Val Gly Tyr Glu Leu Gly lie lie Ser Gly Ala 

50 5 ( 5 ... 60 . 

ictt ctt cag ate aaa acc tta tta gee ctg age tgc cat gag cag gaa 240 
Leu Leu Gin lie Lys Thr Leu /Leu Ala Leu Ser Cys His Glu Gin Glu 
65 * 70 75 80 

atg gtt gtg age tec etc gtc att gga gee etc ctt gee tea etc acc 288 
Met Val Val Ser Ser Leu Val lie Gly Ala Leu Leu Ala Ser Leu Thr 
85 90 95 . 

gga ggg g tc ct g ata g ac a g a tat gga aga agg aca gca ate ate ttg 336 
Gly Gly Val Leu lie Asp Arg Tyr Gly Arg Arg Thr Ala lie lie Leu 
100 ' 105 , 110 

tea tec tgc ctg ctt gga etc gga age tta gtc ttg ate etc agrt tta . 384 
Ser Ser Cys Leu Leu Gly Leu Gly Ser Leu Val Leu lie Leu Ser Leu 
.115 " 120 125 

tec tac acg gtt ctt ata gtg gga cgc att gee ata ggg gte tec ate 4 32 
Ser Tyr Thr Val Leu lie Val Gly Arg lie Ala lie Gly Val Ser lie 
130 135 140 

tec etc tct tec att gee act tgt gtt tac ate gca gag att get cct 480 
Ser Leu Ser Ser lie Ala Thr Cys Val Tyr lie Ala Glu lie Ala Pro 
14fe 150 155 160 

caa cac aga aga ggc ctt ctt gtg tea ctg aat gag ctg atg att gtc 528 
Gin His Arg Arg Gly Leu Leu Val Ser Leu Asn Glu Leu Met lie Val 
165 170 175 

ate ggc M att ctt tct gee tat att tea aat tac gca ttt gee aat gtt 576 
lie Gly lie Leu Ser Ala Tyr lie Ser Asn Tyr Ala Phe Ala Asn Val 
180 185 190 

ttc cat ggc tgg aag tac atg ttt ggt ctt gtg att ccc ttg gga gtt 624 
Phe His Gly Trp Lys Tyr Met Phe Gly Leu Val lie Pro Leu Gly Val 
195 200 205 

ttg caa gca att gca atg tat ttt ctt cct cca age cct egg ttt ctg 672 
Leu Gin Ala lie Ala Met Tyr Phe Leu Pro Pro Ser Pro Arg Phe Leu 
210 215 .220 

gtg atg aaa gga caa gag gga get get age aag gtt ctt gga agg tta 720 
Val Met Lys Gly Gin Glu Gly Ala Ala Ser Lys Val Leu Gly Arg Leu 
225 230 235 240 
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aga gca etc tea gat aca act gag gaa etc act gtg ate aaa t:cc tec 7 68. 

Arg Ala Leu Ser Asp ( Thr Thr „Glu Glu Leu Thr Val lie Lys Ser Ser. 

245 250 ' 255 

ctg aaa gat gaa tat cag tac agt ttt tgg gat ctg ttt cgt tea ' aaa 816 

Leu Lys Asp« Glu Tyr Gin Tyr Ser Phe 'tfrp Asp Leu Phe Arg Ser Lys . 
260 265 ' ' 270 

gac aac atg egg ace cga ata atg ata gga eta aca eta gta ttt ttt 864 

Asp Asn Met Arg Thr Arg lie Met lie Gly Leu Thr, Leu Val Phe Phe 
275 280 ' " 285 

gta caa ate act ggc caa cca aac ata ttg ttc tat gca tea act gtt 912 

Val Gin Tie Thr .Gly Gin Pro Asn He Leu Phe Tyr Ala Ser Thr Val 
290 ' , , 295 300 

. i ' • 

ttg aag tea gtt gdfa ttt caa age aat gag gca gpt age etc gee tec 960 

Leu Lys Ser Val Gly Phe Gin Ser. Asn Glu Ala Ala Ser Leu Ala Ser 

305 310 ' 315 320 

act ggg gtt gga gtc gtc aag gtc att age ace ate cct get act ctt 1008 

Thr Gly Val Gly Val Val Lys Val lie Ser Thr lie Pro Ala Thr Leu 

325 330 335 

■ 

ctt 'gta gac cat gtc ggc age aaa aca ttc etc tgc' att ggc tec tct 1056 

Leu Val Asp Hi's Val -Gly Ser Lys Thr Phe Leu Cys.Ile Gly Ser Ser 
340 345 350 

gtg atg gca get teg ttg gtg acc atg ggc ate gta aat. etc aac ate 1104 

Val Met Ala Ala Ser Leu Val Thr Met Gly He Val Asn Leu Asn lie 
355 360 365 

cac atg aac ttc acc cat ate tgc aga age cac aat tct ate aac cag 1152 

His Met Asn Phe Thr His lie Cys Arg Ser His Asn Ser He Asn Gin 
370 375 380 

tec ttg. gat gag tct gtg att tat gga cca gga aac ctg tea acc aac 1200 

Ser Leu Asp Glu Ser Val He Tyr Gly Pro Gly Asn, Leu Ser Thr Asn 
385 390 395 ' 400 

aac aat act etc aga gac cac ttc aaa ggg att tct tec cat age aga 1248 

Asn Asn Thr Leu Arg Asp His Phe Lys Gly He Ser Ser His Ser Arg 

405 410 415 

age tea etc atg ccc ctg aga aat gat gtg gat aag aga ggg gag acg 1296 

Ser Ser Leu Met Pro Leu Arg Asn Asp Val Asp Lys Arg Gly Glu Thr 
420 425 430 

acc tea gca tec ttg eta aat get gga tta age cac act gaa tac cag 1344 

Thr Ser Ala Ser Leu Leu Asn Ala Gly Leu Ser His Thr Glu Tyr Gin 
435 440 445 

ata gtc aca gac cct ggg gac gtc cca get ttt ttg aaa tgg ctg tec 1392 

He Val Thr Asp Pro Gly Asp Val Pro Ala Phe Leu Lys Trp Leu Ser 
450 455 460 

tta gee age ttg ctt gtt tat gtt get get ttt tea att ggt eta gga 1440 

Leu Ala Ser Leu Leu Val Tyr Val Ala Ala Phe Ser He Gly Leu Gly 
465 470 475 ~ 480 



BNSDOCID: <WO_02055701A2_I_> 



WO 02/055701 



PCTAJS01/49060 



cca atg ccc tgg ctg gtg etc age gag ate ttt.cct.gg_t ggg ate aga 1488 
Pro Met Pro Trp Leu .Val Leu |( Ser Glu lie phe Pro Gly Gly He Arg . 

485 490 ' 495 , . 

gga cga gee atg get tta act tct age atg aac tgg ggc ate aat 1 etc 1536 
Gly Arg Ala .Met Ala Leu Thr Ser Ser Wet Asn Trp Gly lie Asn Leu . 
500 ' 505 "V -510 



etc ate teg ctg aca ttt ttg 
Leu lie Ser Leu Thr Phe Leu 
515 

i 

tgg gtg tgc ttt ata tat aca 
Trp Val Cys Phe .He Tyr Thr 

530 ' 535 



act gta act gat ctt att ggc ctg cca 
Thr Val Thr Asp Leu , He Gly Leu Pro 
520 ( 525 

i i t 

ate atg agt eta gca tec ctg ctt ttt 
He Met Ser Leu Ala Ser Leu Leu Phe 
540 



1584 



■ » 

1632 



gtt gtt atg ttt ata cct gag 
Val Val Met Phe He Pro Glu 
545 " 550 



aca aag gga tgc tct ttg gaa caaata 
Thr. Lys Gly Cys Ser Leu Glu Gin lie 
555 560 



1680 



tea atg gag' eta gca aaa gtg aac tat gtg aaa aac aac att tgt ttt 
Ser Met Glu Leu Ala Lys Val Asn Tyr Val Lys Asn Asn He Cys Phe 



565 



570 



57 5 



1728 



atg agt cat cac caa gaa gaa tta gtg cca* aaa cag'ect caa aaa. aga 
Met Ser His Hi's Gin -Glu Glu Leu Val Pro Lys Gin Pro Gin Lys Arg 
580 5S5 590 



1776 



aaa ccc cag gag cag etc ttg gag tgt aac aag ctg tgt ggt agg ggc 
Lys Pro Gin Glu Gin Leu Leu Glu Cys Asn Lys Leu Cys Gly Arg Gly 
595 600 605 



1824 



caa tec agg cag ctt tct cca gag ace taa 
Gin Ser Arg Gin Leu Ser Pro, Glu Thr * 
610 615 



1854 



<210> 4 

<211> 2230 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (376) . . . (1962) 



<400> 4 

gtcgacccac 

ageaagtgea 

cttcagcccg 

gtctactgag 

tcagggcaag 

ttctccagcc 

cggccccccg 



gcgtccggca acatggegge 
gcgggctcct accccgggtg 
etcctgtccc cgacatcacg 
aeggggagge gtgacagggc 
ctccccgtct ccgggcgcac 
tcctcccctc gcaggtggga 
ggacc atg gee ggg tec 
Met Ala Gly Ser 
1 



tgccgtggtg cagcgcccgg getgagegae 60 
aggggtggcc tccgcgtggg atcgtgccct 120 
tgtattcege acgtcccctc cgcgctgtgt 180 
ccgggtccct tctcagtggt gctctgtgct 240 
ttccctcgcc tgtgttcggt ccatcctcct 300 
tcgtcggtgg gaeeggageg egggegggeg 360 
gac acc gcg ccc ttc etc age cag 411 
Asp Thr Ala Pro Phe Leu Ser Gin 
5 10 



gcg gat gac ccg gac gac ggg cca gtg cct ggc acc ccg ggg ttg cca 
Ala Asp Asp Pro Asp Asp Gly Pro Val Pro Gly Thr Pro Gly Leu Pro 
15 20 25 



459 
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ggg tec acg ggg aac ccg aag tec gag gag ccc gag gtc ceggae cag .507 
Gly Ser Thr Gly Asn Pro Lys Ser Glu Glu Pro. Glu Val Pro Asp Gin 
30 35 ' 40 

gag ggg ctg cag cgc ate acc ggc ctg tct ccc gge cgt teg get etc 555 
Glu Gly Leu Gin Arg lie Thr Gly Leu Ser Pro Gly Arg Ser Ala Leu 
45 • 50 55 , 60 

ata gtg gcg gtg ctg tgc tac ate aat etc ctg aac tac atg gac.cgc 603 
lie Val Ala Val Leu Cys Tyr lie Asn Leu Leu Asn Tyr Met Asp Arg 

65 ' 70' 75 

» • ■ ' 

■ i » " 

ttc acc gtg get ggc gtc ctt ccc gac ate gag cag ttc ttc aac ate .651 
Phe Thr Val Ala Gly Val Leu Pro Asp lie Glu Gin Phe Phe Asn lie 
80 ,85 90 



tjgq gac agt age tct ggg etc ate cag acc gtg' ttc ate tec agt tac 
Gly Asp Ser Ser Ser Gly Leu lie Gin Thr Val Phe lie Ser Ser Tyr 
95 100 105 



699 



atg gtg ttg gca cct gtg- ft t ggc tac ctg ggt gac agg tac aat egg 747 
Met Val Leu Ala Pro Val Phe Gly Tyr Leu Gly Asp Arg Tyr Asn Arg 
110 115 120 

aag tat etc atg tgc ggg ggc att gee ttc tgg tec ctg gtg aca. ctg 795 
Lys Tyr Leu Met Cys Gly Gly lie Ala Phe Trp Ser Leu Val Thr Leu 
125 130 135 • 140 

ggg tea tec ttc ate ccc gga gag cat ttc tgg ctg etc etc ctg acc 84 3 
Gly Ser Ser Phe lie Pro Gly Glu His Phe Trp Leu Leu Leu Leu Thr 
14 5 150 155 

egg ggc ctg gtg ggg gtc ggg gag gee agt tat tec acc ate gcg ccc 891 
Arg Gly Leu Val Gly Val Gly Glu Ala Ser Tyr Ser Thr lie Ala Pro 
160 • 165 170 

act etc att gec gac etc ttt gtg gee gac cag egg age egg atg etc 939 
Thr Leu lie Ala Asp Leu Phe Val Ala Asp Gin Arg Ser Arg Met Leu 
175 180 185 

age ate ttc tac ttt gee att ccg gtg ggc agt ggt ctg ggc tac att 987 
Ser He Phe Tyr Phe Ala lie Pro Val Gly Ser Gly Leu Gly Tyr He 
190 195 200 

gca ggc "tec aaa gtg aag gat atg get gga gac tgg cac tgg get ctg 1035 
Ala Gly Ser Lys Val Lys Asp Met Ala Gly Asp Trp His Trp Ala Leu 
205 210 215 220 

agg gtg aca ccg ggt eta gga gtg gtg gee gtt ctg ctg ctg ttc ctg 108 3 
Arg Val Thr Pro Gly Leu Gly Val Val Ala Val Leu Leu Leu Phe Leu 
225 230 235 

. gta gtg egg gag ccg cca agg gga gee gtg gag cgc cac tea gat ttg 1131 
Val Val Arg Glu Pro Pro Arg Gly Ala Val Glu Arg His Ser Asp Leu 
240 245 250 

cca ccc ctg aac ccc acc teg tgg tgg gca gat ctg agg get ctg gca 1179 
Pro Pro Leu Asn Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala Leu Ala 
255 260 265 
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aga aat cct agt ttc gtc ctg tct tec ctg ggc ttc act get - gtg gec 1227 
Arg Asn Pro Ser Phe Val Leu Ser Ser Leu . Gly, Phe Thr Ala Val Ala 
270 275 280 

ttt gtc acg ggc tec ctg get ctg tgg get ccg gca ttc ctg ctg cgt 1275 , 
Phe Val' Thr Gly Ser Leu Ala Leu Trp Ala Pro Ala Phe Leu Leu Arg 
285 290 295 ' 300 

tec cgc gtg gtc ctt ggg gag ace cca ccc tgc ctt ccc gga gac tec 1323 

Ser Arg Val Val Leu Gly Glu Thr Pro Pro Cys Leu Pro Gly Asp Ser 

305.. ' 310" . ; 315 

* * 

• i > . 

tgc tct tec tct gac agt etc ate ttt gga etc ate ace tgc ctg acc 1371. 

Cys Ser Ser Ser Asp Ser Leu lie Phe Gly Leu lie Thr Cys Leu Thr 

320 , 325 330 

gga gtc ctg ggt gtg ggc ctg ggt gtg gag ate' age cgc egg etc cgc 1419 
Gly Val Leu Gly Val Gly Leu fely Val Glu lie Ser Arg Arg Leu Arg 
335 * " 340 345 

i cac tec aac ccc egg get gat ccc ctg gtc tgt gee act ggc etc ctg 14 67 
His Ser Asn Pro Arg Ala Asp Pro Leu Val Cys Ala Thr Gly Leu Leu 
350 355 360 

ggc tct gca ccc ttc etc ttc ctg tec ctt gee tgc gee cgt ggt age 1515 
Gly Ser Ala Pro Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg Gly Ser 
365 370 375 , 380 

ate gtg gee act tat att ttc ate ttc att gga gag acc etc ctg tec 1563 
lie Val Ala Thr Tyr lie Phe lie Phe lie Gly Glu Thr Leu Leu Ser 
385 390 395 

atg aac tgg gee ate gtg gee gac att ctg ctg tac gtg gtg ate cct 1611 
Met Asn Trp Ala lie Val Ala Asp lie Leu Leu Tyr Val Val lie Pro 
400 405 410 

acc cga cgc tec acc gee gag gee ttc cag ate gtg ctg tec cac ctg 1659 
Thr Arg Arg Ser Thr Ala Glu Ala Phe Gin He; Val Leu Ser His Leu 
415 420 425 

ctg ggt gat get ggg age ccc tac etc att ggc ctg ate tct gac cgc 1707 
Leu Gly Asp Ala Gly Ser Pro Tyr " Leu He Gly Leu He Ser Asp Arg 

430 435 440 f 

ctg cgc "egg aac tgg ccc ccc tec ttc ttg tec gag ttc egg get ctg 1755 
Leu Arg Arg Asn Trp Pro Pro Ser Phe Leu Ser Glu Phe Arg Ala Leu 
445 450 455 460 

cag ttc teg etc atg etc tgc gcg ttt gtt ggg gca ctg ggc ggc gca 1803 
Gin Phe Ser Leu Met Leu Cys Ala Phe Val Gly Ala Leu Gly Gly Ala 
4 65 470 475 

gee ttc ctg ggc acc gee ate ttc att gag gee gac cgc egg egg gca 1851 
Ala Phe Leu Gly Thr Ala He Phe He. Glu Ala Asp Arg Arg Arg Ala 
480 485 490 

cag ctg cac gtg cag ggc ctg ctg cac gaa gca ggg tec aca gac gac 1899 
Gin Leu His Val Gin Gly Leu Leu His Glu Ala Gly Ser Thr Asp Asp 
4 95 500 505 
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egg att gtg gtg ccc cag egg ggc cgc tec ace cgc gtg ccc gtg gee 1947 
Arg lie Val Val Pro Gin Arg Gly Arg Ser Thr Arg Val Pro Val Ala 
510 ►> Sis" ' 520 

agt gtg etc 3tc ttja dfaggcttjccg ctcacctacc tgcacatctg ccacagctigg *2002 
Ser Val Leu lie * . ,, 

525 ... 

ccctgggccc accccacgaa gggcctgggc ctaacccctt .ggcctggccc agcttccaga 2062 
gggaccctgg gccgtgtgcc agctcccaga cactacatgg gtagctcagg ggaggaggtg 2122 
ggggtccagg agggggatcc ctctccacag gggcagcccc aagdjgctcgg tgctatttgt 2182 
aacggaataa aatttgtagc cagaaaaaaa aaaaaaaagg' ( gcggccgc 2239 

<210> 5 

<211> 528 • 

<212> PRT 1 ' 

<213> Homo sapiens ■' 



<400> 5 ' 



Met 


Ala 


Gly 


Ser 


Asp 


Thr 


Ala Pro 


Phe 


Leu 


Ser 


Gin 


Ala 


Asp 


Asp 


Pro 


1 








5 








10 








i 


15 




Asp 


Asp 


Gly 


Pro 


Val 


Pro Gly Thr 


Pro 


Gly 


Leu 


Pro 


Gly 


Ser 


Thr 


Gly 








20 








25 










30 






Asri 


Pro 


Lys 


Ser Glu 


Glu 


Proi Glu 


Val 


Pro 


Asp 


Gin 


Glu 


Gly 


Leu 


Gin 






35 








40 










45 






i 


Arg 


lie 


Thr Gly Leu 


Ser 


Pro Gly. Arg 


Ser 


Ala 


Leu 


He 


Val 


Ala 


Val 




50 








55 








60 










Leu 


Cys 


Tyr 


lie 


Asn 


Leu 


Leu Asn 


Tyr 


Met Asp 


Arg 


Phe 


Thr 


Val 


Ala 


65 










70 








75 










80 


Gly 


Val 


Leu 


Pro 


Asp 


He 


Glu Gin 


fne 


Phe 


Asn 


lie 


Gly Asp 


Ser 


Ser 










85 








90 










95 




Ser 


Gly 


Leu 


lie 


Gin 


Thr 


Val Phe 


He 


Ser 


Ser 


Tyr 


Met 


Val 


Leu 


TV 1 

Ala 








100 








105 










110 






Pro 


val 


Phe 


Gly 


Tyr 


Leu 


Gly Asp 


Arg 


Tyr 


Asn 


Arg 


Lys 


Tyr 


Leu 


Met 






115 








120 










12 5 








Cys 


tsiy 


Gly 


lie 


Ala 


Phe 


Trp Ser 


Leu 


Val 


Thr 


Leu 


Gly 


Ser 


Ser 


.fne 




i ^n 










135 








140 










lie 


Pro 


Gly Glu 


His 


Phe 


Trp Leu 


Leu 


Leu 


Leu 


Thr 


Arg 


Gly 


Leu 


val 


14 5 










150 








155 










160 


Gly 


Val 


Gly Glu 


Ala 


Ser 


Tyr Ser 


Thr 


He 


Ala 


Pro 


Thr 


Leu 


He 


Ala 










165 








170 










17 5 




Asp 


Leu 


Phe 


Val 


Ala 


Asp Gin Arg 


Ser 


Arg 


Met 


Leu 


Ser 


He 


Phe 


Tyr 








180 








185 










190 






Phe 


Ala 


lie 


Pro 


Val 


Gly 


Ser Gly 


Leu 


Gly 


Tyr 


He 


Ala 


Gly 


Ser 


Lys 






195 








200 










205 








Val 


Lys 


Asp 


Met 


Ala 


Gly Asp Trp 


His 


Trp Ala 


Leu Arg 


Val 


Thr 


Pro 




210 










215 








220 










Gly 


Leu 


Gly Val 


Val 


Ala 


Val Leu 


Leu 


Leu 


Phe 


Leu 


Val 


Val 


Arg 


Glu 


225 










230 








235 










240 


Pro 


Pro 


Arg 


Gly Ala 


Val 


Glu Arg 


His 


Ser 


Asp 


Leu 


Pro 


Pro 


Leu 


Asn 










245 








250 










255 




Pro 


Thr 


Ser 


Trp 


Trp 


Ala 


Asp Leu 


Arg 


Ala 


Leu 


Ala 


Arg 


Asn 


Pro 


Ser 








260 








265 










270 






Phe 


Val 


Leu 


Ser 


Ser 


Leu 


Gly Phe 


Thr 


Ala 


Val 


Ala 


Phe 


Val 


Thr 


Gly 






275 








280 










285 








Ser 


Leu 


Ala 


Leu 


Trp 


Ala 


Pro Ala 


Phe 


Leu 


Leu 


Arg 


Ser 


Arg 


Val 


Val 




290 










295 








300 










Leu 


Gly 


Glu 


Thr 


Pro 


Pro 


Cys Leu 


Pro 


Gly Asp 


Ser 


Cys 


Ser 


Ser 


Ser 


305 










310 








315 










320 


Asp 


Ser 


Leu 


lie 


Phe 


Gly Leu He 


Thr 


Cys 


Leu 


Thr 


Gly Val 


Leu 


Gly 










325 








330 










335 
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Val 


Gly 


Leu 


Gly 


Val 


Glu 


He 


Ser 


Arg 


Arg 


Leu Arg 


His 


Ser 


Ash 


Pro 






340 




* * 

Val 




n 


345 


Gly 








350 




Pro 


Arg 


Ala 


Asp 


Pro 


Leu 


Cys 


Ala 


Thr 


Leu 


Leu 


Gly 


Ser 


Ala 






Jbb 










360 










-3^5 






r 

Thr 


Phe 


Leu 


Phe 


Leu 


Ser 


Leu 


Ala 


Cys 


Ala 


Arg 

» » 


Gly 


Ser 


lie 


Val Ala 




370 


i 








375 








380 










Tyr 


lie 


Phe 


He 


Phe 


He 


Gly 


Glu 


Thr 


Leu; Leu 


Ser 'Met 


Asn 


Trp 


Ala 


385 










390 










395 










4 00 


He 


Val 


Ala 


Asp 


lie 


Leu 


Leu 


Tyr 


Val 


Val 


He 


Pro 


Thr Arg Arg 


Ser 








405 










410 










415 




Thr 


Ala 


Glu 


Ala 


Phe 


Gin. 


He 


Val 


Leu 


Ser 


His 


Leu 


Leu 


Gly 


Asp 


Ala 








420 






i 




4 26 








430 






Gly 


Ser 


Pro 


Tyr 


Leu 


He 


Gly 


Leu 


lie 


Ser 


Asp 


Arg 


Leu 


Arg 


Arg 


Asn 




435 










440 










445 








Trp 


Pro 


Pro, 


Ser 


Phe 


,Leu 


Ser 


Glu 


Phe 


Arg 


Ala 


Leu 


Gin 


Phe 


Ser 


Leu 


450 








i 


455 


* 








4 60 










Met 


Leu 


Cys 


Ala 


Phe 


Val 


Gly Ala 


Leu 


Gly 


Gly 


Ala 


Ala 


Phe 


Leu 


Gly 


465 








470 










475 










480 


Thr 


Ala 


lie 


Phe 


lie 


Glu 


Ala 


Asp 


Arg Arg 


Arg 


Ala 


Gin 


Leu 


His 


Val 










485 










490 










495 




Gin 


Gly 


Leu' 


Leu 


His 


Glu 


Ala 


Gly 


Ser 


Thr 


Asp Asp Arg 


He 


Val 


Val 






500 










505 


• 








510 






Pro 


Gin 


Arg 
515 


Gly 


Arg 


Ser 


Thr 
• 


Arg 
520 


Val 


Pro 


Val 


Ala 


Ser 
525 


Val 


Leu 


He 



<210> 6 
<211> 1587 
<212> DNA 

<213> Homo sapiens 

<220> , 
<221> CDS 

<222> (1) . . . (1587) 
<400> 6 

atg gee ggg tec gac acc gcg ccc ttc etc age cag gcg gat gac ccg 48 
Met Ala Gly Ser Asp Thr Ala Pro Phe Leu Ser Gin, Ala Asp Asp Pro 
1 5 10 ■"' 15 

gac gac ggg cca gtg cct ggc acc ccg ggg ttg cca ggg tec acg ggg 96 
Asp Asp Gly Pro Val Pro Gly Thr Pro Gly Leu Pro Gly Ser Thr Gly 
20 25 30 

aac ccg aag tec gag gag ccc gag gtc ccg gac cag gag ggg ctg cag 144 
Asn Pro Lys Ser Glu Glu Pro Glu Val Pro Asp Gin Glu Gly Leu Gin 
35 4 0 4 5 

cgc ate acc ggc ctg tct ccc ggc cgt teg get etc ata gtg gcg gtg 192 
Arg lie Thr Gly Leu Ser Pro Gly Arg Ser Ala Leu He Val Ala Val 
50 55 60 

ctg tgc tac ate aat etc ctg aac tac atg gac cgc ttc acc gtg get 240 
Leu, Cys Tyr lie Asn Leu Leu Asn Tyr Met Asp Arg Phe Thr Val Ala 
65 70 75 80 

ggc gtc ctt ccc gac ate gag cag ttc ttc aac ate ggg gac agt age 288 
Gly Val Leu Pro Asp lie Glu Gin Phe Phe Asn He Gly Asp Ser Ser 
85 90 95 
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tct ggg etc ate cag acc gtg ttc ate tec agt tac atg gtg.ttg gca 336 

Ser Gly Leu lie Gin Thr Val Phe lie Ser Ser, Tyr Met Val Leu Ala 
100 . 105 110 

cct gtg ttt ggc tac ctg ggt gac agg tac aat* egg aa<} tat etc atg 384 

Pro Val* Phe Gly Tyr Leu Gly Asp Arg Tyr Asn Arg Lys Tyr Leu Met 

115 120 ' 125 

tgc ggg ggc att gee ttc tgg tec ctg gtg aca ctg ggg tea tec ttc 432 

Cys Gly Gly lie Ala Phe Trp Ser Leu Val Thr. Leu Gly Ser Ser Phe 
130 135 ' 140- 

j 1 • 

ate ccc gga gag cat ttc tgg ctg etc etc ctg acc egg ggc ctg gtg 480 

He Pro Gly Glu His Phe Trp Leu Leu Leu Leu Thr Arg Gly Leu Val 

145 150 ( 155 160 

ggg gtc ggg gag gec agt tat tec acc ate gcg'ccc act etc att gee 528 

Gl^ Val Gly Glu Ala Ser Tyr Ser Thr He Ala Pro Thr Leu He Ala 

" 165 170 . 17 5 

, gac etc ttt gtg gec gac cag egg age egg atg etc age ate ttc tac 576 

Asp Leu Phe Val Ala Asp Gin Arg Ser Arg Met Leu Ser He Phe Tyr 

180 185 190 . . „ • 

ttt gec att ccg gtg ggc agt ggt ctg ggc tac att gca ggc tec aaa 624 

Phe Ala He Pro Val Gly Ser Gly Leu Gly Tyr He Ala Gly Ser Lys 

195 200 2,05 

gtg aag gat atg get gga gac tgg cac tgg get ctg agg gtg aca ccg 672 

Val Lys Asp Met Ala Gly Asp Trp His Trp Ala Leu Arg Val Thr Pro 
210 215 220 

ggt eta gga gtg gtg gec gtt ctg ctg ctg ttc ctg gta gtg egg gag 720 

Gly Leu Gly Val Val Ala Val Leu Leu Leu Phe Leu Val Val Arg Glu 

225 230 235 240 

ccg cca agg gga gee gtg gag cgc cac tea gat ttg cca ccc ctg aac 7 68 

Pro Pro Arg Gly Ala Val Glu Arg His Ser Asp Leu Pro Pro Leu Asn 

245 250 255 

ccc acc teg tgg tgg. gca gat ctg agg get ctg gca aga aat cct agt 816 

Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala Leu Ala Arg Asn Pro Ser 
260 265 270 

ttc gtc 'ctg tct tec ctg ggc ttc. act get gtg gec ttt gtc acg ggc 864 

Phe Val Leu Ser Ser Leu Gly Phe Thr Ala Val Ala Phe Val Thr Qly 

275 280 285 

tec ctg get ctg tgg get ccg gca ttc ctg ctg cgt tec cgc gtg gtc 912 

Ser Leu Ala Leu Trp Ala Pro Ala Phe Leu Leu Arg Ser Arg Val Val 
290 295 300 

ctt ggg gag acc cca ccc tgc ctt ccc gga gac tec tgc tct tec tct 960 

Leu Gly Glu Thr Pro Pro Cys Leu Pro Gly Asp Ser Cys Ser Ser Ser 

305 310 . . 315 320 

gac agt etc ate ttt gga etc ate acc tgc ctg acc gga gtc ctg ggt 1008 

Asp Ser Leu He Phe Gly Leu He Thr Cys Leu Thr Gly Val Leu Gly 

325 330 335 
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gtg ggc ctg ggt gtg gag ate age cgc egg etc cgc cac tec aac ccc 
Val Gly Leu Gly Val Glu lie Ser Arg Arg Leu Arg His Ser Asri Pro 
.340 . 345 350 



1056 



egg get gat ccc ctg gtc tgt gee act ggc etc ctg ggc tct gca ccc 
Arg Ala Asp Pro Leu Val Cys Ala Thr Gly Leu Leu Gly Ser Ala Pro 
355 360 ' 3^5 



1104 



ttc etc ttc ctg tec ctt gee tgc gee cgt . ggt age ate gtg gee act 
Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg Gly Ser lie Val Ala Thr 
370 375 * - 380- : 



1152 



tat att ttc ate ttc att gga gag acc etc ctg tec atg aac tgg gec 
Tyr lie Phe lie Phe lie Gly Glu Thr Leu Leu Ser Met Asn Trp Ala 
385 390 , 395 400 



1200 



«atc gtg gec gac att ctg ctg tac gtg gtg ate' cct acc cga cgc tec 
lie Val Ala Asp lie Leu Leu Tyr Val Val lie Pro Thr Arg Arg Ser 
* 405 410 415 



1248 



aec gee gag gee ttc cag ate gtg ctg tec cac ctg ctg ggt gat get 
Thr Ala Glu Ala Phe Gin lie Val Leu Ser His Leu Leu Gly Asp Ala 
420 425 430 



1296 



ggg age ccc tac etc att ggc ctg ate tct gac cgc ctg cgc egg aac 
Gly Ser Pro Tyr Leu lie Gly Leu lie Ser Asp Arg Leu Arg Arg Asn 
4 35 .4 40 4,4 5 



1344 



tgg ccc ccc tec ttc ttg tec gag ttc egg get ctg cag ttc teg etc 
Trp Pro Pro Ser Phe Leu Ser Glu Phe Arg Ala Leu Gin Phe Ser Leu 
450 455 460 



1392 



atg etc tgc gcg ttt gtt ggg gca ctg ggc ggc gca gee ttc ctg ggc 
Met Leu Cys Ala Phe Val Gly Ala Leu Gly Gly Ala Ala Phe Leu Gly 
465 470 475 480 



1440 



acc gee ate ttc att gag gec gac cgc egg egg gca cag ctg cac gtg 
Thr Ala lie Phe lie Glu Ala Asp Arg Arg Arg Ala Gin Leu His Val 
485 490 495 



1488 



cag ggc ctg ctg cac gaa gca ggg tec aca gac gac egg att gtg gtg 
Gin Gly Leu Leu His Glu Ala Gly Ser Thr Asp Asp Arg lie Val Val 
500 505 510 



1536 



ccc cag egg ggc cgc tec acc cgc gtg ccc gtg gec agt gtg etc ate 
Pro Gin Arg Gly Arg Ser Thr Arg Val Pro Val Ala Ser Val Leu lie 
515 520 525 



1584 



tga 



1587 



<210> 7 

<211> 4632 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> (225) . 



. (3581) 
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<220> 

<221> misc_feature |fc 
<222> (4611) 
- < 22 3> n = a,c, t, or g 



<400> 7 • 

cacgcgtccg cccacgcgtc cgcccacgcg tccgagcccc cttteaagcc ttagcttccg 60 
gctccaagcc gaccccctcc ccctccctgt ccccttcccc ttctcccatc cctctctcgg 120 
ccacagcgtc ttgttagtcc tctccctcta ctccgcaata ttttctttct ttctccctcc 180 
tctcctccat ttgttgtttg atgtttccca ctctttgagg aagg atg gtt gat ttg 236 

Met Val Asp Leu 



gag age gaa gtg ccc cct ctg cct ccc agg tac agg ttt cga gat ttg 
Glu Ser Glu , Val Pro pro Leu Pro Pro Arg Tyr Arg Phe Arg Asp Leu 
5 , 10 . 15 20 



284. 



ctg eta ggg gac caa gga tgg caa aac gat gac aga 
Leu Leu Gly Asp* Gin Gly Trp Gin Asn Asp Asp Arg 

25 30 . • 

ttc tat atg aat gaa, aat aca ttt aaa gaa aga eta 
Phe Tyr Met Asn Glu Asn Thr Phe Lys Glu Arg Leu 
40 . * 45 



gta caa gtt gaa 332 
Val Gin Val Glu 
35 

i 

aaa tta ttt ttc 380 
Lys Leu Phe Phe 

50. 



ata aaa aac cag aga tea agt eta agg ata cgc ctg 
lie Lys Asn Gin Arg Ser Ser Leu Arg lie Arg Leu 
55 60 



ttc aat ttt tct 
Phe Asn Phe Ser 
65 



428 



etc aaa tta eta age tgc tta tta tac ata ate cga gta eta eta gaa 
Leu Lys Leu Leu Ser Cys Leu Leu Tyr lie lie Arg Val Leu Lsu Glu 
70 75 '' 80 



476 



aac cct tea caa gga aat gaa tgg tct cat ate ttt tgg gtg aac aga 
Asn Pro Ser Gin Gly Asn Glu Trp Ser His lie Phe Trp Val Asn Arg 
85 90 95 1-00 



524 



agt eta cct ttg tgg ggc tta cag gtt tea gtg gca, ttg ata agt ctg 
Ser Leu Pro Leu Trp Gly Leu Gin Val Ser Val Ala Ldu lie Ser Leu 
105 110 115 



572 



ttt gaa aca ata tta ctt ggt tat ctt agt tat aag gga aac ate tgg 
Phe Glu Thr lie Leu Leu Gly Tyr Leu Ser Tyr Lys Gly Asn lie Trp 
120 125 130 



620 



gaa cag att tta cga ata ccc ttc ate ttg gaa ata att aat gca gtt 
Glu Gin lie Leu Arg lie Pro Phe lie Leu Glu lie lie Asn Ala Val 
135 14 0 14 5 



668 



ccc ttc att ate tea ata ttc tgg cct tec tta agg aat eta ttt gtc 
Pro Phe He He Ser He Phe Trp Pro Ser Leu Arg Asn Leu Phe Val 
150 155 160 



716 



cca gtc ttt ctg aac tgt tgg ctt gec aaa cat gec ttg gaa aat atg 
Pro Val Phe Leu Asn Cys Trp Leu Ala Lys His Ala Leu Glu Asn Met 
165 170 175 . 180 



7 64 



att aat gat eta cac aga gee att cag cgt aca cag tct gca atg ttt 
He Asn Asp Leu His Arg Ala lie Gin Arg Thr Gin Ser Ala Met Phe 
185 190 195 



812 
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aat caa gtt ttg att tta ata tct aca tta eta tgc. ctt ate ttc ace 860, 
Asn Gin Val Leu lie Leu lie M Ser Thr Leu ^eu Cys Leu lie Phe Thr . 
200 1 ' 205 ' * 210 

• l : . , r ■ 

tgc att tgt ggg ate caa cat ctg gaacga ata gga aag agg ctg aat 908 

Cys lie Cys i Gly lie Gin His Leu Glu ftrg lie Gly Lys Arg Leu Asn 

215 .220 225 

etc ttt gac tec ctt tat ttc tgc att gtg acg ttt tct act gtg ggc 956 

Leu Phe Asp Ser Leu Tyr Phe Cys lie Val Thr Phe, Ser Thr Val Gly 

230 • 235 240 

' . , . \ ., 

ttc ggg gat gtc act cct gaa aca tgg tec tec aag ctt ttt gta gtt 1004 

Phe Gly Asp Val .Thr Pro Glu Thr Trp Ser Ser Lys Leu Phe Val Val 

245 , 250 .255 . . 260 

get atg att tgt gtt get ctt gtg gtt eta ccc ata cag ttt gaa cag 1052 
Ala Met lie Cys Val Ala Leu Val. Val Leu Pro lie Gin Phe Glu Gin 
* 265 -270 - 275 

ctg get tat* ttg tgg atg gag aga caa aag tea gga gga aac tat. agt HOC 
Leu Ala Tyr Leu Trp Met Glu Arg Gin Lys Ser Gly Gly Asn Tyr Ser 

280 . ■ 285 290 

i * - . 

cga cat aga get caa act gaa aag cat gtc gtc ctg" tgt gtc age tea" 1148 

Arg His Arg Al'a Gin Thr Glu Lys His Val Val Leu Cys Val Ser Ser 
295 300 , 305 ' 

ctg aag att gat tta ctt atg gat ttt tta aat gaa ttc tat get cat 1196 
Leu Lys He Asp Leu Leu Met Asp Phe Leu Asn Glu Phe Tyr Ala His 
310 315 320 

cct agg etc cag gat tat tat gtg gtg att ttg tgt 'cct act gaa atg 1244 
Pro Arg Leu Gin Asp Tyr Tyr Val Val He Leu Cys Pro Thr Glu Met 
325 330 335 340 

gat gta cag gtt cga agg gta ctg cag att cca atg tgg tec caa cga 1292 
Asp Val Gin Val Arg Arg Val Leu Gin lie Pro Met Trp Ser Gin Arg 
34 5 350 ' 355 

gtt ate tac ctt caa ggt tea gee ctt aaa gat caa gac eta ttg aga 134 0 
Val He Tyr Leu Gin Giy Ser Ala Leu Lys Asp Gin Asp Leu Leu Arg 
360 365 370 

gca aag atg gat gac get gag gee tgt ttt att etc agt age cgt tgt 1388 
Ala Lys Met Asp Asp Ala Glu Ala Cys Phe lie Leu Ser Ser Arg Cys 
375 . 380 385 

gaa. gtg gat agg aca tea tct gat cac caa aca att ttg aga gca tgg 1436 
Glu Val Asp Arg Thr Ser Ser Asp His Gin Thr lie Leu Arg Ala Trp 
390 395 400 

get gtg aaa gat ttt get cca aat tgt cct ttg tat gtc cag ata tta 1484 
Ala Val Lys Asp Phe Ala Pro Asn Cys Pro Leu Tyr Val Gin lie Leu 
405 410 415 420 

aag cct gaa aat aaa ttt cac ate aaa ttt get gat cat gtt gtt tgt 1532 
Lys Pro Glu Asn Lys Phe His He Lys Phe Ala Asp His Val Val Cys 
425 430 435 
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gaa gaa gag ttt aaa tac gcc atg tta get tta aac tgt ata . tgc cca .1580 

Glu Glu Glu Phe Lys Tyr Ala Met Leu Ala Leu, As n Cys lie Cys Pro 
440 . 445 450 

gca aca tct aca ctt att aca eta ctg gtt cat acq tct aga ggg caa 1628 , 

Ala Thr' Ser Thr Leu lie Thr Leu Leu Val His. Thr Ser Arg Gly Gin ' 
455 460 / 4 65 

gaa ggc cag caa teg cca gaa caa tgg cag aag atg tac ggt aga tgc 1676 

Glu Gly Gin Gin Ser Pro Glu Gin Tro Gin Lys Met * Tyr Gly Arg Cys 

470 . 475 ' ' 480- . ; * 

■ i 1 

tec ggg aat gaa gtc tac cac att gtt ttg gaa gaa agt aca ttt ttt 1724. 

Ser Gly Asn Glu Val Tyr His lie Val Leu Glu Glu Ser Thr Phe Phe 

485 490 , 495 . 500 

get gaa tat gaa gga aag agt ttt aca tat gcc' tct ttc cat gca cac 1772 

Ala Glu Tyr Glu Gly Lys Ser the Thr Tyr Ala Ser Phe His Ala His 
* 505 510 515 

•aaa aag ttt ggc gtc tgc ttg att ggt gtt agg.agg gag gat aat aaa 1820 

Lys Lys Phe Gly Val Cys Leu lie Gly Val Arg Arg Glu Asp Asn Lys 
520 525 530 

aac att ttg ctg aat cca ggt cct cga tac att atg aat tct acg gac 1868 

Asn lie Leu Leu Asn Pro Gly Pro Arg Tyr lie Met Asn Ser Thr Asp 
535 540 5,4 5 

ata tgc ttt tat att aat att ace aaa gaa gag aat . tea gca ttt aaa 1916 

lie Cys Phe Tyr He Asn He Thr Lys Glu Glu Asn Ser Ala Phe Lys 
550 . 555 560 

aac caa gac cag cag aga aaa age aat gtg tec agg teg ttt tat cat 1964 

Asn Gin Asp Gin Gin Arg Lys Ser Asn Val Ser Arg Ser Phe Tyr His 

565 570 575 580 



gga cct tec aga tta cct gta cat age ata att gcc age atg ggt act 
Gly Pro Ser Arg Leu Pro Val His Ser He lie Ala Ser Met Gly Thr 
585 590 -. 595 



2012 



gtg get ata gac ctg caa gat aca age tgt aga tea gca agt ggc cct 2060 

Val Ala He Asp Leu Gin Asp Thr Ser Cys Arg Ser Ala Ser Gly Pro 

600 605 610 

ace ctg 'tct ctt cct aca gag gga age aaa gaa ata aga aga cct age 2108 

Thr Leu Ser Leu Pro Thr Glu Gly Ser Lys Glu He Arg Arg Pro Ser 

615 620 " 625 

att get cct gtt tta gag gtt gca gat aca tea teg att caa aca tgt 2156 

lie Ala Pro Val Leu Glu Val Ala Asp Thr Ser Ser lie Gin Thr Cys 
630 635 640 

gat ctt eta agt gac caa tea gaa gat gaa act aca cca gat gaa gaa 2204 

Asp Leu Leu Ser Asp Gin Ser Glu Asp Glu Thr Thr Pro Asp Glu Glu 

645 650 . 655 660 

atg tct tea aac tta gag tat get aaa ggt tac cca cct tat tct cca 2252 

Met Ser Ser Asn Leu Glu Tyr Ala Lys Gly Tyr Pro Pro Tyr Ser Pro 
665 670 675 
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tat ata gga agt tea ccc act ttt tgt cat etc ctt cat gaa .aaa gta 2300 
Tyr He Gly Ser Ser Pro Thr Phe Cys His Leu.Leu His Glu Lys Val 
680 . 685 690 



cca ttt tgc tgc tta aga tta gac aag agt tgc caa: cat aac tac tat 
Pro Phe* Cys Cys Leu Arg Leu Asp Lys Ser Cys. Gin His Asn Tyr Tyr 
695 700 ' 705 



2348 



gag gat gca aaa gee tat gga ttc aaa aat aaa eta att ata gtt gca 2396 

Glu Asp Ala Lys Ala Tyr Gly Phe Lys Asn Lys Leu He He Val Ala 

710 715 ' ■ * . 720 ; ' 

** | i 

get gaa aca get gga aat gga tta tat aac ttt att gtt cct etc agg 2444 

Ala Glu Thr Ala Gly Asn Gly Leu Tyr Asn Phe He Val Pro Leu Arg 

725 730 ( 735 ' 740 

gca tat tat aga cca aag aaa gaa ctt aat ccc 'ata gta ctg eta ttg 2492 

Ala Tyr Tyr Arg Pro Lys Lys tSlu Leu Asn Pro He Val Leu Leu Leu 

* 745 750 755 

i'gat aac ccc eta gat gac tta etc agg tgt gga gtg act ttt get get 2540 

Asp Asn Pro Leu Asp Asp Leu Leu Arg. Cys Gly Val Thr Phe Ala Ala 

760 765 770 

aat atg gtg gtt gtg gat aaa gag . age ace atg agt gee gag gaa gac 2588 

Asn Met Val Val Val Asp Lys Glu Ser Thr Met Ser Ala Glu Glu Asp 
775 780 7§5 

tac atg gca gat gee aaa ace att gtg aac gtg cag aca etc ttc agg 2636 

Tyr Met Ala Asp Ala Lys Thr He Val Asn Val Gin Thr Leu Phe Arg 
7 90 7 95 800 

ttg ttt tec agt etc agt att ate aca gag eta act cac ccc gee aac 2684 

Leu Phe Ser Ser Leu Ser He He Thr Glu Leu Thr His Pro Ala Asn 

805 810 815 820 

atg aga ttc atg caa ttc aga gee aaa gac tgt tac tct ctt get ctt 2732 

Met Arg Phe Met Gin Phe Arg Ala Lys Asp Cys Tyr Ser Leu Ala Leu ~" 

82.5 830 835 

tea aaa ctg gaa aag aaa gaa egg gag aga ggc tct aac ttg gee ttt 2780 

Ser Lys Leu Glu Lys Lys Glu Arg Glu Arg Gly Ser Asn Leu Ala Phe 

840 845 850 

atg ttt ega ctg cct ttt get get ggg agg gtg ttt age ate agt atg 2828 

Met Phe Arg Leu Pro Phe Ala Ala Gly Arg Val Phe Ser He Ser Met 
855 860 865 

ttg. gac act ctg ctg tat cag tea ttt gtg aag gat tat atg att tct 2876 

Leu Asp Thr Leu Leu Tyr Gin Ser Phe Val Lys Asp Tyr Met He Ser 
870 875 880 

ate acg aga ctt ctg ttg gga ctg gac act aca cca gga tct ggg ttt 2924 
He Thr Arg Leu Leu Leu Gly Leu Asp Thr Thr Pro Gly Ser Gly Phe 

885 890 895 900 

ctt tgt tct atg aaa ate act gca gat gac tta tgg ate aga act tat 2972 
Leu Cys Ser Met Lys lie Thr Ala Asp Asp Leu Trp He Arg Thr Tyr 

905 910 915 
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gcc aga ctt tat cag aag ttg tgt tct tct act gga gat gtt ccc att 
Ala Arg Leu Tyr Gin Lys j^eu Cys Ser Ser Thr Gly Asp Val Pro lie 
920 M ' ' ' Q ™ 



925 



gga ate tac agg act gag tct cag aaa ctt act a : ca. tct gag tct » dga 
Gly lie Tyr, Arg Thr Glu Ser Gin Lys 'Leu Thr Thr Ser Glu Ser Arg 
935 940 -945 



3020 



'3068 



aaa ata 
Lys lie 
950 

gac acc 
Asp Thr 
965 



gca tea caa tct caa ata tct ate agt gta gaa gag tgg gaa 3116 

Ala Ser Gin Ser Gin He Ser He Ser Val Glu Glu Trp Glu 

955 960 ' ■ 

aaa gac tec aaa gaa caa ggg cac cac cgc age aac cac cgc 3164 

Lys Asp ,Ser Lys Glu Gin Gly His His Arg Ser Asn His Arg 
970 975 980 



aac tea aca tec agt gac cag teg gac cat ccc ttg ctg egg aga aaa 
Ash Ser Thr Ser Ser Asp Gin Ser Asp His Pro Leu Leu Arg Arg Lys 
* 985 . 990 995 



3212 



age atg cag' tgg gcc cga aga ctg age aga aaa ggc cca aaa cac tct 
Ser Met Gin Trp Ala Arg Arg Leu Seir Aorg Lys Gly Pro Lys His Ser 
1000 1005 1010 



3260 



ggt aaa aca get gaa aaa ata acc cag cag cga ctg- aac etc tac agg 
Gly Lys Thr Ala Glu ,Lys lie Thr Gin Gin Arg Leu Asn Leu Tyr Arg 
1015 1020 1025' 



3308 



agg tea gaa aga caa gag ctt get gaa ctt gtg aaa aat aga atg aaa 
Arg Ser Glu Arg Gin Glu Leu Ala Glu Leu Val Lys Asn Arg Met Lys 
1030 1035 1040 



3356 



cac ttg ggt ctt tct aca gtg gga tat gat gaa atg 'aat gat cat caa 
His Leu Gly Leu Ser Thr Val Gly Tyr Asp Glu Met Asn Asp His Gin 
1045 1050 1055 1060 



3404 



agt acc etc tec tac ate ctg att aac cca tct cca gat acc aga ata 
Ser Thr Leu Ser Tyr lie Leu He Asn Pro Ser Pro Asp Thr Arg He 
1065 1070 ' ■". 1075 



3452 



gag ctg aat gat gtt gta tac tta att cga cca gat cca ctg gcc tac 3500 
Glu Leu Asn Asp Val Val Tyr Leu He Arg Pro Asp Pro Leu Ala Tyr 
1080 1085 1090 



ctt cca aac agt gag ccc agt cga aga aac age ate tgc aat gtc act 
Leu Pro Asn Ser Glu Pro Ser Arg Arg Asn Ser lie Cys Asn Val Thr 
1095 1100 1105 



3548 



ggt caa gat tct egg gag gaa act caa ctt tga taaaaataaa atgagaaact 3601 
Gly Gin Asp Ser Arg Glu Glu Thr Gin Leu * 
1110 1115 



tttttcctac 
atggaaatat 
atcattttga 
tcaatggaat 
tgttttataa 
gttgacaaaa 
ttaaaaatta 
ategtaaatg 
tgtgtaagaa 



aaagaccttg 
atgtaattct 
aacttaatgt 
aaatttgaaa 
taatccaaac 
tctaactata 
aatgtgtcat 
tctagtcttc 
tcatagtttg 



cttgaaacca 
ctcatattta 
actacttact 
agctaaatta 
tctatgaaag 
tttggtgcat 
attatattct 
tcacatagag 
ctttagaata 



caaaagtttt 
aaaacgtaat 
ggtactctcc 
aaatacaaaa 
caattttaaa 
cacaatggac 
ttaaacttac 
attaaccaac 
caaatcttta 



gctggcacga 
etcttctctt 
ctattaatat 
atttaaatct 
aattattaag 
acagaatget 
tgttttacaa 
aaacttgtgt 
agtcatttta 



aagaaactag 
agaagtatag 
ttgaaggacc 
gacatttaat 
gttttatgaa 
gctgctcctc 
aattgagctc 
ggctgacttt 
actttttttt 



3661 
3721 
3781 
3841 
3901 
3961 
4021 
4081 
4141 
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ctgccttacg atataaaaat atttatctta gaatttgaga tgttcatagc atgttttatt . 4 201 
acattgaaga aactaaaaca taaatgaaaa gaaacactag- gttcctgcac tttttggtaa 4261 
ctttat gtct agcaaatatt ttat gccaag aaaagcatac tataaagcaa atatctatta 4 321 
ttctcctaaa cgaatgccta gcatagagaa, aatacttaat, acacatttg.t tgacttaaat. 4*3"8T 
ttaattcaag gattgaaaaa ttaactggat atcttgaaat atacagtaat gattgtcctt 44 41 
agactcttga aetttaccat ctttcctatt catatatcta tajtagtaaat ttcactagaa 4 501 
aaattctttt aaaattgaca gaagataatt tatacctttt atggactctg aagacacttc 4561 
aaaacattaa aagtccttat gtctttggta atgaaacata cactcaatga ngatgtatta 4 621 
aattttgact t . 4 632 

• » . ■ 

<210> 8 , 

<211> 1.118 . » ■ 

<212> PRT 

<213> Homo sapiens * 

i 

<400> 8 

ttfet Val Asp Leu Glu Ser Glu Val Pro Pro Leu Pro Pro Arg Tyr Airg 

1 5 10 15 

Phe Arg Asp Leu Leu Leu Gly Asp Gin Gly Trp . Gin Asn Asp Asp Arg 
20 25 30 

1 Val Gin Val Glu Phe Tyr Met Asn Glu Asn Thr Phe Lys Glu Arg Leu 
35 40 45 

Lys Leu Phe Phe . He Lys Asn Gin Arg Ser Ser Leu Arg lie Arg Leu 

50 55 1 60 ' 

Phe Asn Phe Ser Leu Lys Leu Leu Ser Cys Leu Leu Tyr lie lie Arg 
65 70 75 80 

Val Leu Leu Glu Ash Pro Ser Gin Gly Asn Glu Trp Ser His He Phe' 

85 90 95 

Trp Val Asn Arg Ser Leu Pro Leu Trp Gly Leu Gin Val Ser Val Ala 

100 105 110 

Leu He Ser Leu Phe Glu Thr He Leu Leu Gly Tyr Leu Ser Tyr Lys 

115 120 125 

Gly Asn He Trp Glu Gin lie Leu Arg lie Pro Phe He Leu Glu lie 

130 135 140 

lie .Asn Ala Val Pro Phe He He Ser He Phe Trp Pro Ser Leu Arg 
145 150 155 160 

Asn Leu Phe Val Pro Val Phe Leu Asn Cys Trp Leu Ala Lys His Ala 

165 170 175 

Leu Glu Asn Met He Asn Asp Leu His Arg Ala He Gin Arg Thr Gin 

180 185 190 

Ser Ala Met Phe Asn Gin Val Leu lie Leu He Ser Thr Leu Leu Cys 
.195 200 205 

Leu He Phe Thr Cys He Cys Gly He Gin His Leu Glu Arg lie Gly 

210 215 - 220 

Lys Arg Leu Asn Leu Phe Asp Ser Leu Tyr Phe Cys He Val Thr Phe 
225 230 235 240 

Ser Thr Val Gly Phe Gly Asp Val Thr Pro Glu Thr Trp Ser Ser Lys 

245 250 255 

Leu Phe Val Val Ala Met He Cys Val Ala Leu Val . Val Leu Pro He 

260 265 270 

Gin Phe Glu Gin Leu Ala Tyr Leu Trp Met Glu Arg Gin Lys Ser Gly 

275 280 285 

Gly Asn Tyr Ser Arg His Arg Ala Gin Thr Glu Lys His Val Val Leu 

290 2 95 300 

Cys Val Ser Ser Leu Lys lie Asp Leu Leu Met Asp Phe Leu Asn Glu 
305 310 315 320 

Phe Tyr Ala His Pro Arg Leu Gin Asp Tyr Tyr Val Val He Leu Cys 

325 ~ 330 335 

Pro Thr Glu Met Asp Val Gin Val Arg Arg Val Leu Gin He Pro Met 
340 ' 345 350 
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Trp Ser Gin Arg Val lie Tyr Leu Gin Gly Ser Ala Leu Lys Asp Gin 

355 m M '360 • ' , 365 ' 

Asp Leu Leu Arg Ala Lys Met Asp Asp Ala Glu Ala Cys Phe lie Leu 

— — TT5 : ~~ r 57$ " : —mr— , r 

Ser Ser Arg Cys Glu Val Asp Arg Thr Ser Ser Asp" His Gin Thr lie 
385 . ' ■ ' 390 395 ./ " 400 

Leu Arg Ala Trp Ala Val Lys Asp Phe Ala Pro Asn Cys Pro Leu Tyr 

405 410 415 

Val Gin lie Leu Lys Pro Glu Asn Lys Phe His lie Lys Phe Ala Asp 

420 ;425 • 430 

His Val Val Cys Glu Glu. Glu Phe Lys Tyr Ala > Met Leu Ala Leu Asn 

435 i 440 ' ' 445 

Cys IleCys Pro Ala Thr Ser Thr Leu lie Thr Leu Leu Val His Thr 

450 - 455 460 

Ser Arg Gly 'Gin Glu Gly Gin Gin Ser Pro Glu Gin Trp Gin Lys Met 
465 . ( '470 . 1 475 480 

Tyr Gly Arg Cys Ser Gly Asn Glu Val Tyr His lie Val Leu Glu Glu 

• . 485 4 90 495 

Ser Thr Phe Phe* Ala Glu Tyr Glu Gly Lys Ser Phe Thr Tyr Ala Ser 

. 500 505 ■ ' 510 

Phe His Ala His Lys Lys Phe Gly Val Cys Leu lie Gly Vai Arg Arg 

515 520 525 

Glu Asp Asn Lys Asn lie Leu ( Leu Asn Pro Gly Pro Arg Tyr lie Met 

530 535 , 540 , 

Asn Ser Thr Asp lie Cys Phe Tyr lie Asn lie Thr Lys Glu Glu Asn 
545 '550 555 , 560 

Ser Ala Phe Lys Asn Gin Asp Gin Gin Arg Lys Ser Asn Val Ser Arg 

565 570 575 

Ser Phe Tyr His Gly Pro Ser Arg Leu Pro Val His Ser lie lie Ala 

580 585 590 

Ser Met Gly Thr Val Ala lie Asp Leu Gin Asp Thr Ser Cys Arg Ser 

595 ' 600 . .605 

Ala Ser Gly Pro Thr Leu Ser Leu Pro Thr Glu Gly Ser Lys Glu lie 

610 615 620 ; 

Arg Arg Pro Ser lie Ala Pro Val Leu Glu Val Ala Asp Thr Ser Ser 
625 630 635 640 

lie Gin Thr Cys Asp Leu Leu Ser Asp Gin Ser Glu Asp Glu Thr Thr 

645 650 • 655 

Pro Asp Glu Glu Met Ser Ser Asn ,Leu Glu Tyr Ala Lys Gly Tyr Pro 

660 665 670 

Pro Tyr Ser Pro Tyr lie Gly Ser Ser Pro Thr Phe Cys His Leu Leu 

675 .' 680 685 

His Glu Lys Val Pro Phe Cys Cys Leu Arg Leu Asp Lys Ser Cys Gin 

690 695 , . 700 

His Asn Tyr Tyr Glu Asp Ala Lys Ala Tyr Gly Phe Lys Asn Lys Leu 
705 " 710 715 720 

lie lie Val Ala Ala Glu Thr Ala Gly Asn Gly Leu Tyr Asn Phe lie 

725 730 735 

Val Pro Leu Arg Ala Tyr Tyr Arg Pro Lys Lys Glu Leu Asn Pro lie 

740 745 750 

Val Leu Leu Leu Asp Asn Pro Leu Asp Asp Leu Leu Arg Cys Gly Val 

755 760 765 

Thr Phe. Ala Ala Asn Met Val Val Val Asp Lys Glu Ser Thr Met Ser 

770 775 780 

Ala Glu Glu Asp Tyr Met Ala Asp Ala Lys Thr lie Val Asn Val Gin 
785 790 795 800 

Thr Leu Phe Arg Leu Phe Ser Ser Leu Ser lie lie Thr Glu Leu Thr 

805 810 815 

His Pro Ala Asn Met Arg Phe Met Gin Phe Arg Ala Lys Asp Cys Tyr 
820 825 830 
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Ser Leu 


Ala 


Leu 


Ser 




Leu 


Glu 


Lys 


Lys 


Glu 


Arg 


Glu 


Arg Gly Ser 






835 






i » 




.840 










845 








Asn 


Leu. 


Ala 


Phe 


Met 


Phe 


Arg 


Leu 


Pro Phe Ala Ala 


Gly Arg 


Val 


Phe 




850 






• 


• 


65£ 










8,60 








, r 


Ser 


lie 


Ser 


Met 


Leu 


Asp 


Thr 


Leu 


Leu 


Tyr 


Gin 


Ser 


Phe 


Val 


Lys Asp 


865 










870 








875 










880 


Tyr 


Met 


He 


Ser 


lie 


Thr 


Arg 


Leu 


Leu 


Leu 


Gly 


Leu 


'Asp 


Thr 


Thr 


Pro 








885 










890 










895 




Gly 


Ser 


Gly 


Phe 


Leu 


Cys 


Ser 


Met 


Lys 


He 


Thr 


Ala 


Asp 


Asp 


Leu 


Trp 






900 










905 










910 






lie 


Arg 


Thr Tyr 


Ala 


Arg. 


Leu 


Tyr 


Gin 


Lys 


Leu , Cys 


Ser 


Ser 


Thr 


Gly 






915 






i 


920 


i 






• 


925 








Asp 


Val 
930 


Pro 


He 


Gly 


He 


Tyr 
935 


Arg 


Thr 


Glu 


Ser 


Gin 
940 


Lys 


Leu 


Thr 


Thr 


Ser 


Glu 


Ser, 


Arg 


Lys 


lie 


Ala 


Ser 


Gin 


Ser 


Gin 


He 


Ser 


lie 


Ser 


Val 


945 






,950 




» 






955 










960 


Glu 


Glu 


Trp 


Glu 


Asp Thr Lys Asp 


Ser 


Lys 


Glu 


Gin 


Gly 


His 


His 


Arg 










.965 










970 










975 




Ser 


Asn 


His 


Arg 
980 


Asn 


Ser 


Thr 


Ser 


Ser 
'985 


Asp 


Gin 


Ser 


Asp 


His 
990 


Pro 


Leu 


Leu 


Arg 


Arg Lys 


Ser 


Met 


Gin 


Trp 


Ala 


Arg 


Arg 


Leu 


Ser 


Art( Lys 


Gly 






995 










1000 


» 






1005 






Pro 


Lys 


His 


Ser 


Gly Lys 


Thr ^ia 


Glu 


Lys 


He 


Thr 


Gin 


Gin 


Arg 


Leu 




1010 








1015 








1020 


i 






Asn 


Leu 


Tyr 


Arg 


Arg 


Ser 


Glu 


Arg 


Gin 


Glu 


Leu 


Ala 


'Glu 


Leu 


Val 


Lysf 


1025 








'1030 








1035 








1041 


Asn 


Arg 


Met 


Lys 


His 


Leu 


Gly 


Leu 


Ser 


Thr 


Val 


Gly 


Tyr 


Asp 


Glu 


Met 










1045 








1050 








1055 


Asn Asp 


His 


Gin 


Ser 


Thr 


Leu 


Ser 


Tyr 


lie 


Leu 


He 


Asn 


Pro 


Ser 


Pro 








1060 








1065 








1070 




Asp 


Thr 


Arg 


He 


Glu 


Leu 


Asn 


Asp 


Val 


Val 


Tyr 


Leu 


lie 


Arg 


Pro 


Asp 






1075 








1080 








108 5 






Pro 


Leu 


Ala 


Tyr 


Leu 


Pro 


Asn 


Ser 


Glu 


Pro 


Ser 


Arg 


Arg 


Asn 


Ser 


He 



1090 1095 1100 

Cys Asn Val Thr Gly Gin Asp Ser Arg Glu Glu Thr Gin Leu 
1105 1110 1115 



<210> 9 • 

<211> 3357 . - 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> CDS • 
<222> (1) . . '. (3357) 

<400> 9 

atg gtt gat ttg gag age gaa gtg ccc cct ctg cct ccc agg tac agg 48 

Met Val Asp Leu Glu Ser Glu Val Pro Pro Leu Pro Pro Arg Tyr Arg 

1 5 10 15 

ttt ega gat ttg ctg eta ggg gac caa gga tgg caa aac gat gac aga 96 
Phe Arg Asp Leu Leu Leu Gly Asp Gin Gly Trp Gin Asn Asp Asp Arg 
20 25 30 

ttt aaa gaa aga eta 14 4 
Phe Lys Glu Arg Leu 
45 
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gta caa gtt gaa ttc tat atg aat gaa aat aca 
Val Gin Val Glu Phe Tyr Met Asn Glu Asn Thr 
35 40 
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i 

aaa tta ttt ttc ata aaa aac cag aga tea agt eta agg ata cgc ctg 192 

Lys Leu Phe Phe lie Lys Asn Gin Arg Ser Ser Leu Arg lie Arg Leu 
50 55 60 

ttc a at ttt tct etc aaa tta eta age tgc tta 1 tta tac ata ate cga 240 

Phe Asn' Phe Ser Leu Lys Leu Leu Ser Cys Leu Leu Tyr He He Arg 
65 .70 75 ' 80* 

gta eta eta gaa aac cct tea caa gga aat gaa tgg tct cat ate ttt 288 

Val Leu Leu Glu Asn Pro Ser Gin Gly Asn Glu Trp Ser His He Phe 

85 * -90' ; ■ 95 

tgg gtg aac aga agt eta cct ttg tgg ggc tta cag gtt tea gtg gca 336 

Trp Val Asn Arg Ser Leu Pro Leu Trp Gly Leu Gin Val Ser Val Ala 

100 105 110 

i ; 

ttg ata agt ctg ttt gaa aca ata tta ctt ggt* tat ctt agt tat aag 384 

Leu He Ser Leu Phe Glu Thr lie Leu Leu Gly Tyr Leu Ser Tyr Lys 

115 * 120 . 125 

, gga aac ate tgg gaa cag att tta cga ata ccc ttc ate ttg gaa ata 432 

Gly Asn He Trp Glu Gin lie Leu Arg lie Pro Phe He Leu Glu lie 
130 135 140 

att aat- gca gtt ccc ttc att ate tea ata ttc tgg cct tec tta agg 480 

lie Asn Ala Val Pro Phe lie lie Ser lie Phe Trp Pro Ser Leu Arg 

145 • 150 155 ( 160 

aat eta ttt gtc cca gtc ttt ctg aac tgt tgg ctt gee aaa cat gee 528 

Asn Leu Phe Val Pro Val Phe Leu Asn Cys Trp Leu Ala Lys His Ala 

165 170 . 175 

ttg gaa aac atg att aat gat eta cac aga gee att cag cgt aca cag 576 

Leu Glu Asn Met lie Asn Asp Leu His Arg Ala lie Gin Arg Thr Gin 
180 185 190 

tct gca atg ttt aat caa gtt ttg att tta ata tct aca tta eta tgc 624 

Ser Ala Met Phe Asn Gin Val Leu lie Leu lie Ser Thr Leu Leu Cys 

195 200 205 

ctt ate ttc acc tgc att tgt ggg ate caa cat ctg gaa cga ata gga 672 

Leu lie Phe Thr Cys lie Cys Gly He Gin His Leu Glu Arg lie Gly 
210 215 220 

aag agg ctg aat etc ttt gac tec ctt tat ttc tgc att gtg acg ttt 720 

Lys Arg Leu Asn Leu Phe Asp Ser Leu Tyr Phe Cys lie Val Thr ^he 
225 230 235 240 

tct act gtg ggc ttc ggg gat gtc act cct gaa aca tgg tec tec aag 768 

Ser Thr Val Gly Phe Gly Asp Val Thr Pro Glu Thr Trp Ser Ser Lys 

245 250 255 

ctt ttt gta gtt get atg att tgt gtt get ctt gtg gtt eta ccc ata 816 

Leu Phe Val Val Ala Met He Cys Val Ala Leu Val Val Leu Pro lie 
260 265 270 

cag ttt gaa cag. ctg get tat ttg tgg atg gag aga caa aag tea gga 864 

Gin Phe Glu Gin Leu Ala Tyr Leu Trp Met Glu Arg Gin Lys Ser Gly 

275 280 285 
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i ... 

gga aac tat agt cga cat aga get caa act gaa aag cat gtc gtc ctg 912 

Gly Asn Tyr Ser Arg His Arg Ala Gin Thr Glu Lys His Val Val Leu 

290 2 95 \ 300 

tgt gtc age tea ctg aag att gat tta ctt atg ga,t ttt. tta aat gaa 960 

Cys Vai Ser Ser Leu Lys lie Asp Leu Leu Met Asp Phe Leu Asn Glu 

305 310 315 .' 320 . 

ttc tat get cat cct agg etc cag gat tat tat gtg gtg att ttg tgt 1008 

Phe Tyr Ala His Pro Arg Leu Gin Asp Tyr Tyr. Val Val lie Leu Cys 

325 330* ; 335 

»■ . •..*. 

cct act gaa atg gat gta cag gtt cga agg gta ctg cag att cca atg 1056 

Pro Thr Glu Met Asp Val Gin Val Arg Arg Val Leu Gin lie Pro Met 

340 , 345 350 

ftgg tec caa cga gtt ate tac ctt caa ggt tea' gee ctt aaa gat caa 1104 

Trp Ser Gin Arg Val lie Tyr 'Leu Gin Gly Ser Ala Leu Lys Asp Gin 
355 360 365 

gac eta ttg aga gca aag atg gat gac get gag gee tgt ttt att etc 1152 

Asp Leu Leu Arg Ala Lys Met Asp Asp Ala Glu Ala Cys Phe lie Leu 
370 " 375 * 380 

agt age cgt tgt gaa gtg gat agg aca tea tct gat cac caa aca att 1200 

Ser Ser Arg Cys Glu Val Asp Arg Thr Ser Ser Asp His Gin Thr lie 

385 390 395 , 4 00 

ttg aga gca tgg get gtg aaa gat ttt get cca aat tgt cct ttg tat 1248 

Leu Arg Ala Trp Ala Val Lys Asp Phe Ala Pro Asn Cys Pro Leu Tyr 

405 410 .415 

gtc cag ata tta aag cct gaa aat aaa ttt cac ate aaa ttt get gat 1296 

Val Gin lie Leu Lys Pro Glu Asn Lys Phe His lie Lys Phe Ala Asp 

420 425 430 

cat gtt gtt tgt gaa gaa gag ttt aaa tac gee atg tta get tta aac 134 4 

His Val Val Cys Glu Glu Glu Phe Lys Tyr Ala Met Leu Ala Leu Asn 
435 440 445 

tgt ata tgc cca gca aca tct aca ctt att aca eta ctg gtt cat acc 1392 . 

Cys lie Cys Pro Ala Thr Ser Thr Leu lie Thr Leu Leu Val His Thr 
450 455 460 

tct aga "ggg caa gaa ggc cag caa teg cca gaa caa tgg cag aag atg 1440 

Ser Arg Gly Gin Glu Gly Gin Gin Ser Pro Glu Gin Trp Gin Lys Met 

465 470 475 480 

tac ggt aga tgc tec ggg aat gaa gtc tac cac att gtt ttg gaa gaa 1488 

Tyr Gly Arg Cys Ser Gly Asn Glu Val Tyr His lie Val Leu Glu Glu 

485 490 495 

i 

agt aca ttt ttt get gaa tat gaa gga aag agt ttt aca tat gee tct 1536 

Ser Thr Phe Phe Ala Glu Tyr Glu Gly Lys Ser Phe Thr Tyr Ala Ser 

500 505 510 

ttc cat gca cac aaa aag ttt ggc gtc tgc ttg att ggt gtt agg agg 1584 

Phe His Ala. His Lys Lys Phe Gly Val Cys Leu lie Gly Val Arg Arg 
515 520 525 
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gag gat aab aaa aac att ttg ctg aat cca ggt cct cga tac att atg 1632, 
Glu Asp Asn Lys Asn lie Leu ,^eu Asn Pro Qly Pro Arg Tyr He Met , 

530 " 535 ' 540 / . 

aat tct acg gac ata tgc tti tat att aat att acc. aaa gaa gag 'aat 1680 
Asn Ser Thr iAsp He Cys Phe Tyr He Asn He Thr Lys Glu Glu Asn . 
545 550 .555 ' ■ 560 

tea gca ttt aaa aac caa gac cag cag aga aaa age aat gtg tec agg 1728 

Ser Ala Phe Lys Asn Gin Asp Glri Gin Arg Lys Ser ,Asn Val Ser Arg 

565 . 570 575 

t » 

teg ttt tat cat gga cct tec aga tta cct gta cat age ata att gee 177 6 1 
Ser Phe Tyr His Gly Pro Ser Arg Leu Pro Val His Ser lie lie Ala . 
( 580 ' 585 590 

age atg ggt act gtg get ata gac ctg caa gat aca age tgt aga tea 1824 
Ser Met Gly Thr Val Ala lie Asp Leu Gin Asp Thr Ser Cys Arg Ser 
595 * 600 605 . 

gca agt ggc' cct acc ctg tct ctt cct aca gag gga age aaa gaa ata 1872 
Ala Ser Gly Pro Thr Leu Ser Leu Pro Thr Glu Gly Ser Lys Glu lie 
610 ' 615 620 

aga aga cct age att get cct gtt tta gag' gtt gca gat aca tea . teg* 1920 
Arg Arg Pro Set He Ala Pro Val Leu Glu Val Ala Asp Thr Ser Ser 
625 " 630 , 635 .64 0 

att caa aca tgt gat ctt eta agt gac caa tea gaa gat gaa act aca 1968 
lie Gin Thr Cys Asp Leu Leu Ser Asp Gin Ser Glu Asp Glu Thr Thr 
645 650 655 

. . ■ i 

cca gat gaa gaa atg tct tea aac tta gag tat get aaa. ggt tac cca 2016 
Pro Asp Glu Glu Met Ser Ser Asn Leu Glu Tyr Ala Lys Gly Tyr Pro 
660 665 670 

cct tat tct cca tat ata gga agt tea ccc act ttt tgt cat etc ctt 2064 
Pro Tyr Ser Pro Tyr lie Gly Ser Ser Pro Thr Phe Cys His Leu Leu 
675 " . 680 '685 

cat gaa aaa gta cca ttt tgc tgc tta aga tta gac aag agt tgc caa 2112 
His Glu Lys Val Pro Phe Cys Cys Leu Arg Leu Asp Lys Ser Cys Gin 
690 695 700 

cat aac tac tat gag gat gca aaa gee tat gga ttc aaa aat aaa eta 2160 
His Asn Tyr Tyr Glu Asp Ala Lys Ala Tyr Gly Phe Lys Asn Lys Leu 
705 710 715 720 

att ata gtt gca get gaa aca get gga aat gga tta tat aac ttt att 2208 
lie lie Val Ala Ala Glu Thr Ala Gly Asn Gly Leu Tyr Ash Phe lie 
725 730 735 

gtt cct etc agg gca tat tat aga cca aag aaa gaa ctt aat ccc ata 2256 
Val Pro Leu Arg , Ala Tyr Tyr Arg Pro Lys Lys Glu Leu Asn Pro lie 
740 745 750 

gta ctg eta ttg gat aac ccc eta gat gac tta etc agg tgt gga gtg 2304 
Val Leu Leu Leu Asp Asn Pro Leu Asp Asp Leu Leu Arg Cys Gly Val 
,755 * 760 . 765 
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act ttt get- get aat atg gtg g.tt gtg gat aaa gag age ace atg agt 2352, 

Thr Phe Ala Ala Asn Met Val yal Val Asp I^ys Glu Ser Th* Met Ser , 
770 ' * 775 ' 780 

■ ■ " » « i i r 

gec gag. gaa gac tac atg gca gat gee aaa ace att.-gtg aac gtg bag 2400 

Ala Glu Glu Asp Tyr Met Ala Asp Ala Lys Thr lie Val Asn Val Gin 

785 790 .795 ' ■ 800 

aca etc ttc agg ttg ttt tec agt etc agt att ate aca gag eta act 2448 

Thr Leu Phe Arg Leu Phe Ser Ser Leu Ser lie lie ,Thr Glu Leu Thr 
805 810 815 

• i 1 1 i * » 

cac ccc gee aac atg aga ttc atg caa ttc aga gee aaa gac tgt tac 24 96 

His Pro Ala Asn Met Arg Phe Met Gin Phe Arg Ala Lys Asp Cys Tyr ... 

,820 , 825 83.0 

tct ctt get ctt tea aaa ctg gaa aag aaa gaa egg gag aga ggc tct 2544 

Ser Leu Ala Leu Ser Lys Leu Glu Lys Lys Glu Arg Glu Arg Gly Ser 
835 840 - 845 

aac ttg gee 'ttt atg ttt cga ctg cct ttt get get ggg agg gtg. ttt 2592 

Asn Leu Ala Phe Met Phe Arg Leu Pro. Phe Ala Ala Gly Arg Val Phe 

850 855 860 

i * ■ 

age ate agt atg ttg gac act ctg ctg tat' cag tea ttt gtg aag gat 1 2640 

Ser lie Ser Met Leu Asp Thr Leu Leu Tyr Gin Ser Phe Val Lys Asp 
865 870 , 875 880 

tat atg att tct ate acg aga ctt ctg ttg gga ctg gac act aca cca 2688 

Tyr Met lie Ser lie Thr Arg Leu Leu Leu Gly Leu Asp Thr Thr Pro 
885 890 895 

gga tct ggg ttt ctt cgt tct atg aaa ate act gca gat gac tta tgg 2736 

Gly Ser Gly Phe Leu Cys Ser Met Lys lie Thr Ala Asp Asp Leu Trp 

900 905 910 

ate aga act tat gee aga ctt tat cag aag ttg tgt tct tct act gga 2784 

lie Arg Thr Tyr Ala Arg Leu Tyr Gin Lys' Leu Cys Ser Ser Thr Gly 
915 920 92'5 

gat gtt ccc att gga ate tac agg act gag tct cag aaa ctt act aca 2832 

Asp Val Pro lie Gly lie Tyr Arg Thr Glu Ser Gin Lys Leu Thr Thr 
930 935 940 

tct gag tct cga aaa ata gca tea caa tct caa ata tct ate agt gta 2880 

Ser Glu Ser Arg Lys lie Ala Ser Gin Ser Gin lie Ser lie Ser Val 
945 950 955 960 

gaa gag tgg gaa gac acc aaa gac tec aaa gaa caa ggg cac cac cgc 2928 

Glu Glu Trp Glu Asp Thr Lys Asp Ser Lys Glu Gin Gly His His Arg 
965 970 975 

age aac cac cgc aac tea aca tec agt gac cag teg gac cat ccc ttg 297 6 

Ser Asn His Arg Asn Ser Thr Ser Ser Asp Gin Ser Asp His Pro Leu 

980 985 990 

ctg egg aga aaa age atg cag tgg gee cga aga ctg age aga aaa ggc 3024 

Leu Arg Arg Lys Ser Met Gin Trp Ala Arg Arg Leu Ser Arg Lys Gly 
995 1000 1005 
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cca aaa cac tct ggt aaa aca get gaa' aaa ata acc cag cag cga ctg 
Pro Lys His Ser Gly Lys Thr Ala Glu Lys He, Thr Gin Gin Arg Leu 
1010 1015 1020 



3072 



aac etc tac agg agg tea gaa aga caa gag .ctt get gaa ctt gtg aaa 3120 
Asn Leu 1 Tyr Arg Arg Ser Glu Arg Gin Glu Leu- Ala Glu Leu Val Lys 
1025 1030 1035 ' ' 1040 



aat aga atg aaa cac ttg ggt ctt tct aca gtg gga tat gat gaa atg 
Asn Arg Met Lys His Leu Gly Leu Ser Thr Val Gly Tyr Asp Glu Met 
■ 104 5. ' 1050 ; 1055 



3168 



aat gat cat caa agt acc etc tec tac ate ctg att aac cca tct cca 
Asn Asp His Gin Ser Thr Leu Ser Tyr He Leu He Asn Pro Ser Pro 
1060 , • 1065 1Q70 



3216. 



gat acc aga ata gag ctg aat gat gtt gta tac'tta att cga cca gat 
Asp Thr Arg He Glu Leu Asn Asp Val Val Tyr Leu He Arg Pro Asp 
1075 1080 1085 



3264 



cca ctg gec tac ctt cca aac agt gag ccc agt cga aga aac age ate 
Pro Leu Ala Tyr Leu Pro Asn Ser Glu Pro Ser Arg Arg Asn Ser lie 
1090 1095 1100 



3312 



tgc aat gtc act ggt caa gat tct egg gag gaa act caa ctt tga 
Cys Asn Val Thr Gly Gin Asp Ser Arg Glu Glu Thr Gin Leu * 
1105 1110 1115 , 



3357 



<210> 10 

<211> 2847 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> CDS 
<222> (561) 

<4d0> 10 
ccacgcgtcc 
gggacagctc 
cagggegact 
tcacggccag 
cgagctccat 
tggcaaatcc 
ttctccccca 
caagtagttt 
ccagagaacc 
gtcatgtctg 



(2477) 



ggccctgtgc 
cagcagtggc 
tcaccttcaa 
ccgatctgcc 
ggtttgccca 
gcagatcaca 
ccccagcgcc 
ctegteggag 
tcgtccactc 
agtcacagag 



ttcggatggc ggcgggaggt tgatggcgag 
tgatttgggg gagaaacaaa atetgeagat 
gtggtgagct ctcctgacct gcggccagtc 
egctcccgga ggggteggge agtgccggct 
accctgcgcg atggtgactc. tgggcgcgga 
gaatgaaggc ggggagcgcg gccggcggcc 
cagggaagcg gctcaaccac ctgaatcegg 
aagggegget cacctgggcg ccaagactca 
ggaaaccaaa gcagaaccac ttttctctcg 
atg ggc aag ate gag aac aac gag 
Met Gly Lys lie Glu Asn Asn Glu 
1 ' .5 



tggtgctgaa 60 
ggaatccgag 120 
tccactccat 180 
ggacccgccc 240 
ggttggcgac 300 
ggeggggget 360 
aaaacgccaa 420 
gtcccgctgc 480 
gtctcgttaa 540 
agg gtg ate 593 
Arg Val He 
10 



etc aat gtc ggg ggc acc egg cac gaa acc tac cgc age acc etc aag 
Leu Asn Val Gly Gly Thr Arg His Glu Thr Tyr Arg Ser Thr Leu Lys 
15 .20 25 



641 



acc ctg cct gga aca cgc ctg gee. ctt ctt gec tec tec gag ccc cca 689 
Thr Leu Pro Gly Thr Arg Leu Ala Leu Leu Ala Ser Ser Glu Pro Pro 
30 35 40 
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ggc gac tge ttg acc acg gcg ggc gac aag ctg cag .ccg teg e'eg cct 737 , 

Gly Asp Cys Leu Thr Thr Ala ,,Gly Asp Lys ^eu Gin Pro Ser Pro Pro . 

45 M 50 55 . . 

cca ctg teg ccg ccg ccg aga gcg ccc ccg ctg tec ccc ggg cca 'ggc 785 

Pro Leu SenPro Pro Pro Arg Ala Pro feo Leu Ser Pro Gly Pro Gly . 
60 65 ; 70 \' 75 

ggc tgc ttc gag ggc ggc gcg ggc aac tgc agt tec cgc ggc ggc agg 833 

Gly Cys Phe Glu Gly Gly Ala Gly Asn Cys Ser Ser ,Arg Gly Gly Arg 

80 85 90 

l ' ' ' 

gee age gac cat ccc ggt ggc ggc cgc gag ttc ttc ttc gac egg cac 881 

Ala Ser Asp His »Pro Gly Gly Gly Arg Glu Phe Phe Phe Asp Arg His 

,95 , 100 105 

i ' ■ . 

ccg ggc gtc ttc gee tat gtg etc aat tac tac cgc acc ggc aag ctg 929 

Pro Gly Val Phe Ala Tyr Val Leu Asn Tyr Tyr Arg Thr Gly Lys Leu 

110 " 115 120 

cac tgc ccc* gca gac gtg tgc ggg ccg qtc ttc gag gacj gag ctg gec 977 

His Cys Pro Ala Asp Val Cys Gly Pro Leu Phe Glu Glu Glu Leu Ala 

125 130 135 

i * . 

ttc tgg ggc ate gac gag acc gac gtg ga^ ccc tgc' tgc tgg atg acc 1025 

Phe Trp Gly lie Asp.'Glu Thr Asp Val Glu Pro Cys Cys Trp Met Thr 

140 145 , 150 155 

tac egg cag cac cgc gac gee gag gag gcg ctg gac ate ttc gag acc 1073 

Tyr Arg Gin His Arg Asp Ala Glu Glu Ala Leu Asp lie Phe Glu Thr 

160 165 170 

ccc gac etc att ggc ggc gac ccc ggc gac gac gag gac ctg gcg gee 1121 

Pro Asp Leu lie Gly Gly Asp Pro Gly Asp Asp Glu Asp Leu Ala Ala 

175 " 180 185 

aag agg. ctg ggc ate gag gac gcg gcg ggg etc ggg ggc ccc gac ggc 1169 

Lys Arg Leu Gly lie Glu Asp Ala Ala Gly Leu Gly ( Gly Pro Asp Gly 
190 195 200 

aaa tct ggc egc tgg agg agg ctg cag ccc cgc atg tgg gee etc ttc 1217 

Lys Ser Gly Arg Trp Arg Arg Leu Gin Pro Arg Met Trp Ala Leu Phe 

205 " 210 215 

gaa gac ccc tac teg tec aga gee gee acjg ttt att get ttt get tct 1265 

Glu Asp Pro Tyr Ser Ser Arg Ala Ala Arg Phe lie Ala Phe Ala Ser 
220 225 230 235 

tta ttc ttc ate ctg gtt tea att aca act ttt tgc ctg gaa aca cat 1313 

Leu Phe Phe He Leu Val Ser lie Thr' Thr Phe Cys Leu Glu Thr His 

240 245 250 

gaa get ttc aat att gtt aaa aac aag aca gaa cca gtc ate aat ggc 1361 

Glu Ala Phe Asn He Val Lys Asn Lys Thr Glu Pro Val He Asn Gly 

255 260 265 

aca agt gtt gtt eta cag tat gaa att gaa acg gat cct gee ttg acg 1409 

Thr Ser Val Val Leu Gin Tyr Glu He Glu Thr Asp Pro Ala Leu Thr 
270 275 280 
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tat gta gaa gga gtg tgt gtg gtg tgg ttt act ttt gaa ttt tta gtc 1457 

Tyr Val Glu Gly Val Cys Val Val Trp Phe Thr ..Phe Glu Phe Leu Val 
285 290 295 

cgt att gtt ttt tea ccc aat aaa ctt gaa ttc atc : aaa aat etc ttg 1505 

Arg lie* Val Phe Ser Pro Asn Lys Leu Glu Phe -lie Lys Asn Leu Leu 

300 . 305 310 ' ' 315 

aat ate att gad ttt gtg gee ate eta cct ttc tac tta. gag gtg gga 1553 

Asn lie He Asp Phe Val Ala He Leu Pro Phe Tyr Leu Glu Val Gly 

320 . ' 325 ' ; 330 

etc agt ggg ctg tea tec aaa get get aaa gat gtg ctt ggc ttc etc 1601 

Leu Ser Gly Leu Ser Ser Lys Ala Ala Lys Asp Val Leu Gly Phe Leu 

335 , 34 0 34,5 



agg gtg gta agg ttt gtg agg ate ctg aga att ttc aag etc ace cgc 
Arg Val Val Arg Phe Val Arg lie Leu Arg He Phe Lys Leu Thr Arg 
350 * 355 360 



1649 



•cat ttt gta ggt ctg agg gtg ctt gga cat act ctt cga get agt act 1697 
His Phe Val Gly Leu Arg Val Leu Gly His Thr Leu Arg Ala Ser Thr 
365 370 375 

aat gaa ttt ttg ctg ctg ata att ttc ctg get eta gga gtt ttg ata 1745 
Asn Glu Phe Leu Leu Leu He He Phe Leu Ala Leu Gly Val Leu He 
380 385 390 , 395 

ttt get acc atg ate tac tat gee gag aga gtg gga get caa cct aae 1793 
Phe Ala Thr Met He Tyr Tyr Ala Glu Arg Val Gly Ala Gin Pro Asn 
400 405 410 

gac cct tea get agt gag cac aca cag ttc aaa aac att ccc att ggg 1841 
Asp Pro Ser Ala Ser Glu His Thr Gin Phe Lys Asn He Pro He Gly 
415 420 425 

ttc tgg tgg get gta gtg acc atg act acc ctg ggt tat ggg gat atg 1889 
Phe Trp Trp Ala Val Val Thr Met Thr Thr Leu Gly Tyr Gly Asp Met 
430 435 440 

tac ccc caa aca tgg tea ggc atg ctg gtg gga gee ctg tgt get ctg 1937 
Tyr Pro Gin Thr Trp Ser Gly Met Leu Val Gly Ala Leu Gys Ala Leu 
445 450 455 

get gga gtg ctg aca ata gee atg cca gtg cct gtc att gtc aat aat 1985 
Ala Gly Val Leu Thr lie Ala Met Pro Val Pro Val lie Val Asn Asn 
460 465 470 475 

ttt gga atg tac tac tec ttg gca atg gca aag cag aaa ctt cca agg 2033 
P'he Gly Met Tyr Tyr Ser Leu Ala Met Ala Lys Gin Lys Leu Pro Arg 
480 485 490 

aaa aga aag aag cac ate cct cct get cct cag gca age tea cct act . 2081 
Lys Arg Lys Lys His He Pro Pro Ala Pro Gin Ala Ser Ser Pro Thr 
4 95 500 505 

ttt tgc aag aca gaa tta aat atg gee tgc aat agt aca cag agt gac 2129 
Phe Cys Lys Thr Glu Leu Asn Met Ala Cys Asn Ser Thr Gin Ser Asp 
510 515 520 
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r 

aca tgt ctg ggc aaa gac aat cga ctt ctg gaa cat aac aga . tea gtg 

Thr Cys Leu Gly Lys Asp Asn Arg Leu Leu Glu, His Asn Arg Ser Val 
525 530 535 



2177 



tta tea ggt gac gac agt aca gga agt gag ccg cca eta tea ccc cca 
Leu Ser' Gly Asp Asp Ser Thr Gly Ser Glu Pro. Pro Leu Ser Pro Pro 
540 545 " 550 * ' 555 



2225 



gaa agg etc ccc ate aga cgc tct agt acc aga gac aaa aac aga aga 
Glu Arg Leu Pro lie Arg Arg Ser Ser Thr Arg Asp Lys Asn Arg Arg 
560 ' 565' ; 570 



2273 



ggg gaa aca tgt ttc eta. ctg acg aca ggt gat tac acg tgt get tct 
Gly Glu Thr Cys Phe Leu Leu Thr Thr Gly Asp Tyr Thr Cys Ala Ser 
575 580 585 



2321 



gat gga ggg ate agg aaa gga tat gaa aaa tec' cga age tta aac aac 2369 
Asjb Gly Gly lie Arg Lys Gly tyr Glu Lys Ser Arg Ser Leu Asn Asn 
590 * 595 600 



ata gcg ggc ttg gca ggc aat get ctg agg etc tct cca gta aca tea 
He Ala Gly Leu Ala Gly Asn Ala Leu Arg Leu Ser Pro Val Thr Ser 
605 * 610 . 615 



2417 



ccc tac aac tct cct tgt cct ctg. agg cgc tct cga tct ccc ate cca 
Pro Tyr Asn Ser Pro Cys Pro Leu Arg Arg Ser Arg Ser Pro He Pro 
620 625 630 , 635 



2465 



tct ate ttg taa accaaacaac caaactgeat cagteggcta aattgtatta 
Ser He Leu * 



2517 



attcaagygc 
ttaatacagt 
gctaatgaat 
taattgeect 
actaatatat 
accacttcat 



.tgtttacccc 
ataaatcttg 
cttacccagg 
gatatccttt 
aaacataccc. 
tggaacttta 



ataatggaaa taattaaatg 
cgtgatacta caatttgaag 
ctttaaagat tgtctaaagt 
tgcaataaaa tgacagatag 
tcagggagat atatttaaaa 
tttcttgtga 



tagagttact ccaggctcca 2577 
teagaaatge cacttgggta 2637 
agtgctaaga tccctcctat 2697 
tgtcagatat tgaccagtgc 2757 
cagtgtgctt ccaaatgcca 2817 

2847 



<2l0> 11 

<211> 638 

<212> PRT 

<213> Homo sapiens 



<400> 11 



Met 


Gly 'Lys 


lie 


Glu 


Asn 


Asn 


Glu 


Arg 


Val 


lie 


Leu 


Asn 


Val 


Gly 


Gly 


1 








5 










10 










15 


• 


Thr 


Arg 


His 


Glu 


Thr 


Tyr Arg 


Ser 


Thr 


Leu 


Lys 


Thr 


Leu 


Pro 


Gly 


Thr 






20 










25 










30 






Arg 


Leu 


Ala 
35 


Leu 


Leu 


Ala 


Ser 


Ser 
40 


Glu 


Pro 


Pro 


Gly 


Asp 
45 


Cys 


Leu 


Thr 


Thr 


Ala 
50 


Gly 


Asp 


Lys 


Leu 


Gin 
55 


Pro 


Ser 


Pro 


Pro 


Pro 
60 


Leu 


Ser 


Pro 


Pro 


Pro 


Arg 


Ala 


Pro 


Pro 


Leu 


Ser 


Pro 


Gly 


Pro 


Gly 


Gly Cys 


Phe 


Glu 


Gly 


65 










70 










75 










80 


Gly 


Ala 


Gly 


Asn 


Cys 
85 


Ser 


Ser 


Arg 


Gly 


Gly 
90 


Arg 


Ala 


Ser 


Asp 


His 
95 


Pro 


Gly 


Gly 


Gly 


Arg 


Glu 


Phe 


Phe 


Phe 


Asp 


Arg 


His 


Pro 


Gly 


Val 


Phe 


Ala 






100 










105 










110 






Tyr 


Val 


Leu 
115 


Asn 


Tyr 


Tyr 


Arg 


Thr 
120 


Gly 


Lys 


Leu 


His 


Cys 
125 


Pro 


Ala 


Asp 
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vai 

OJ. li 


Thr 


oiy 
Asp 


tit) 

Val 
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Glu ' 
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Glu Glu 
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blU 
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AT a 
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(11 \r 
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Tip 
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150" 










rb-b 










1 DU 
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Glu 


Glu 
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Leu Asp 


He 


Phe. 
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Pro Asp 


Leu 


He 


vj J. _y 






i 




165 










•170 










JL / O 




ft! V 


Asp 


Pro Gly Asp Asp Glu 


Asp 
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Leu 




Tie* 
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d. L> -J 
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Lie U 


Oil 1 


P r* 0 
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1 


£. U 










Del 


t\i~ y 
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7\ 1 
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O T C 
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He 


V cM X . 
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£ y u 
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He 


Lys 
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Leu 


Leu 
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T 1 0 
11c 
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fne 


JUD 
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t 
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"3 "5 Ci 
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vai 


Arg 


lie 


Leu 


Arg 


lie 


Phe 


Lys 


Leu 


Thr 


Arg 


His 


Phe 


vax 


bxy 


Leu 






o c c 

355 










360 










365 








Arg 


Vdl 


Leu 


Gly 


His 


Thr 


Leu 


Arg 


Ala 


Ser 


Thr 


Asn 


Glu 


nu 0 

fne 


Leu 


Leu 




J / u 










37 5 






Leu 




380 










Leu 


x xe 


lie 


Phe 


Leu 


Ala 


Leu 


Gly 


Val 


He 


Phe ' Ala 


1 nr 


Met f- 


Tin 

lie 












390 










"3 Q c: 

3y 0 






• 




/i nn 


Tyr 


Tyr 


Ala 


Glu 


Arg 


Val 


Gly 


Ala 


Gin 


Pro 


Asn 


Asp 


Pro 


Ser 


7\ 1 -> 
M.1 a 


oer 




















410 










H X O 




Pin 
olU 


nis 


Tnr 


Gin 


Pne 


Lys 


Asn 


He 


Pro 


He 


bxy 


Phe 


Trp 


Trp 


nla 


vai 








yi 0 n 










425 










a 

m JU 








i nr 


Met 


rnr 


rp v» -v- 

1 nr 


Leu 


Gly 


Tyr 


Gly Asp 


Met 


Tyr' 


Pro 


bin 


1 nr 


Trp 






A Ci 










440 










445 








O n V 


\jJLy 


Met 


Leu 


vax 


Gly Ala 


Leu 


Cys 


Ala 


Leu 


Ala 


Gly 


V cl JL 


Leu 


Thr 

1 nr 














455 










460 










1JL6 


rtxa 


l v JcU 


Pro 


vax 


Pro 


Val 


He 


Val 


Asn 


Asn 


Phe 


Gly 


Met 


Tyr 


Tyr 


A £^ 










470 










4 7S 










/ion 
M 0 u 




Leu 


m.x a 


iyje i_ 


A 1 0 
Hid 


Lys 


Gin 


Lys 


Leu 


Pro 


Arg 


Lys 


Arg 


Lys 


Lys 


MIS 










4 85 










490 










H yO 




Tip 

1 J. t; 


D ~r~ r\ 




nla 


Pro 


Gin 


Ala 


Ser 


Ser 


Pro 


1 nr 


Phe 


Cys 


Lys 


1 III 










J U u 










505 










cin 

JiU 






Leu 


Asn 


Met. 


ax a 


Cys 


Asn 


Ser 


Thr 


Gin 


Ser 


Asp 


Thr 


Cys 


Leu 


oxy 


Lys 






515 










520 










525 






Don 


Asn 


Arg 


Leu 


Leu 


Glu 


His 


Asn 


-Arg 


Ser 


Val 


Leu 


Ser 


Glv 

0 jl jr 


Den 
no fcs 


Asp 




O jU 










535 










540 










Ser 


Thr 


Gly 


Ser 


Glu 


Pro 


Pro 


Leu 


Ser 


Pro 


Pro 


Glu 


Arg 


Leu 


Pro 


Tie 

1 J- <3 


OHO 










550 










555 










D DU 




ril y 


Ser 


Ser 


Thr 


Arg 


Asp 


Lys 


Asn 


Arg 


Arg 


Gly Glu 


Thr 

J. 1 1 JL 


^y& 


Php 
r lie 










565 










570 










D / D 




Leu 


Leu 


Thr 


Thr 


Gly 


Asp 


Tyr 


Thr 


Cys 


Ala 


Ser 


Asp 


Gly 


Gly 


He 


Arg 








580 










.585 










590 






Lys 


Gly 


Tyr 


Glu 


Lys 


Ser 


Arg 


Ser 


Leu 


Asn 


Asn 


He 


Ala 


Gly 


Leu 


Ala 






595 










600 










605 









-31 - 



BNSDOCID: <WO_02055701A2_I_> 



WO 02/055701 



PCT/US01/49060 



Gly Asn Ala Leu Arg Leu Ser. Pro Val Thr Ser Pro Tyr Asn Ser Pro 

610 615 „ , 620 

Cys Pro Leu Arg Arg 'Ser Arg Ser Pro lie ^ro Ser lie Leu 
-623- : 630- -63-5- 

■: ' ' V ' ... : .. ■ • r 

» ... 

<210> 12 ... 

<211> 1917 

<212> DNA 

<213> Homo sapiens 

<220> ' , ' . 

<221> CDS 

<222> (1) . . . (19L7) 

<400> 12 ' . , 

atg ggc aag ate gag aac aac gag agg gtg ate etc aat gtc ggg ggc 4 8 

Met Gly Lys lie Glu Asn Asn Glu Arg Val lie Leu Asn Val Gly Gly 

1 * 5 . 10 . .15 

ace egg cac gaa acc tac cgc age ace etc aag acc ctg cct gga aca 96 

Thr Arg His Glu Thr Tyr Arg Ser Thr Leu Lys Thr Leu Pro Gly Thr 

20 . 25 30 

I * 

cgc ctg gec ctt ctt gec tec tec gag ccc cca ggc gac tgc ttg acct 144 
Arg Leu Ala Ldu Leu .Ala Ser Ser Glu Pro Pro Gly Asp Cys Leu Thr 
35 40 , 45 ' 

acg gcg ggc gac aag ctg cag ccg teg ccg cct cca ctg teg ccg ccg 192 
Thr Ala Gly Asp Lys Leu Gin Pro Ser Pro Pro Pro Leu Ser Pro Pro 
50 55 60 

ccg aga gcg ccc ccg ctg tec ccc ggg cca ggc ggc'tgc ttc gag ggc ?.40 
Pro Arg Ala Pro Pro Leu Ser Pro Gly Pro Gly. Gly Cys Phe Glu Gly 
65 70 75 " 80 

ggc gcg ggc aac tgc agt tec cgc ggc ggc agg gee age gac cat ccc 288 
Gly Ala Gly Asn Cys Ser Ser Arg Gly Gly Arg Ala Ser Asp His Pro 
85 90 ' ■■• 95 

ggt ggc ggc cgc gag ttc ttc ttc gac egg cac ccg ggc gtc ttc gec 336 
Gly Gly Gly Arg Glu Phe Phe Phe Asp Arg His Pro Gly Val Phe Ala 
100 105 110 

tat gtg etc aat tac tac cgc acc ggc aag ctg cac tgc ccc gca gac 38.4 
Tyr Val Leu Asn Tyr Tyr Arg Thr Gly Lys Leu His Cys Pro Ala Asp 
115 120 125 

gtg tgc ggg ccg etc ttc gag gag gag ctg , gec ttc tgg ggc ate gac 4 32 
Val Cys Gly Pro Leu Phe Glu Glu Glu Leu Ala Phe Trp Gly lie Asp 
130 135 140 

gag acc gac gtg gag ccc tgc tgc tgg atg acc tac egg cag cac cgc 4 80 
Glu Thr Asp Val Glu Pro Cys Cys Trp Met Thr Tyr Arg Gin His Arg 
145 150 155 160 

gac gee gag gag gcg ctg gac ate ttc gag acc ccc gac etc att ggc 528 
Asp Ala Glu Glu Ala Leu Asp lie Phe Glu Thr Pro Asp Leu lie Gly 
165 170 175 



-32 



BNSDOCID: <WO_02055701 A2_l_> 



WO 02/055701 PCTAJS01/49060 

ggc gac ccc ggc gac gac gag gac ctg gcg gcc aag agg ctg. ggc ate 57 6 
Gly Asp Pro Gly Asp Asp Glu Asp Leu Ala Ala, Lys Arg Leu Gly lie 
180 185 190 

gag gac gcg gcg ggg etc ggg ggc ccc gac ggc' aaa tct' ggc cgc tgg 624 
Glu Asp' Ala Ala Gly Leu Gly Gly Pro Asp Gly. Lys Ser Gly Arg Trp 
195 200 ' 205 • 

agg agg ctg cag ccc cgc atg tgg gcc etc ttc gaa gac ccc tac teg 672 
Arg Arg Leu Gin Pro Arg Met Trp Ala Leu Phe Glii Asp Pro Tyr Ser 

210 .215 * * 220 ; 

• » » ' ■ 

tec aga gcc gcc agg ttt att get ttt get tct tta ttc ttc ate ctg 720 
Ser Arg Ala Ala Arg Phe lie Ala Phe Ala Ser Leu Phe Phe lie Leu 
225 230 ( 235 ■ 240 

gtt tea att aca act ttt tgc ctg gaa aca cat' gaa get ttc aat att .7 68 
Val Ser lie Thr Thr Phe Cys Leu Glu Thr His Glu Ala Phe Asn lie 
*245 250 • 255 

i gtt aaa aac aag aca gaa cca gtc ate aat ggc aca agt gtt gtt eta 816 
Val Lys Asn Lys Thr Glu Pro Val lie Asn Gly Thr Ser Val Val Leu 
260 265 270 

cag tat gaa att gaa acg gat cct gcc ttg acg tat gta gaa gga gtg 8 64 
Gin Tyr Glu lie Glu Thr Asp Pro Ala Leu Thr Tyr Val Glu Gly Val 
275 280 ^ 2jB5 

tgt gtg gtg tgg ttt act ttt gaa ttt tta gtc cgt att gtt ttt tea 912 
Cys Val Val Trp Phe Thr Phe Glu Phe Leu Val Arg lie Val Phe Ser 
290 295 300 

ccc aat aaa ctt gaa ttc ate aaa aat etc ttg aat ate att gac ttt 960 
Pro Asn Lys Leu Glu Phe lie Lys Asn Leu Leu Asn lie lie Asp Phe 
305 310 315 320 

gtg gcc ate eta cct ttc tac tta gag gtg gga etc agt ggg ctg. tea 1008 
Val Ala lie Leu Pro Phe Tyr Leu Glu Val Gly Leu Ser Gly Leu Ser 
325 330 335 

tec aaa get get aaa gat. gtg ctt gge ttc etc agg gtg gta agg ttt 1056 
Ser Lys Ala Ala Lys Asp Val Leu Gly Phe Leu Arg Val Val Arg Phe 
340 345 350 

gtg agg ate ctg aga att ttc aag etc ace cgc cat ttt gta ggt ctg 1104 
Val Arg lie Leu Arg lie Phe Lys Leu Thr Arg His Phe Val Gly Leu 
355 360 365 

agg gtg ctt gga cat act ctt cga get agt act aat gaa ttt ttg ctg 1152 
Arg Val Leu Gly His Thr Leu Arg Ala Ser Thr Asn Glu Phe Leu Leu 
370 ^ 375 380 

ctg ata att ttc ctg get eta gga gtt ttg ata ttt get acc atg ate 1200 
Leu lie He Phe Leu Ala Leu Gly Val Leu He Phe Ala Thr Met He 
385 390 395 400 

tac tat gcc gag aga gtg gga get caa cct aac gac cct tea get agt 124 8 
Tyr Tyr Ala Glu Arg Val Gly Ala Gin Pro Asn Asp Pro Ser Ala Ser 
405 410 415 
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gag cac aca cag ttc aaa aac att ccc att ggg ttc tgg tgg get gta . 1296 
Glu His Thr Gin Phe Lys Asn lie Pro lie Gly Phe Trp Trp Ala Val 
420 . 425 430 

gtg acc atg act acc ctg ggt tat ggg gat atg tac ccc caa aca tgg 134 4 , . 
Val Thr Met Thr Thr Leu Gly Tyr Gly Asp Met Tyr Pro Gin. Thr Trp 
435 440 / 445 

tea ggc atg ct<g gtg gga gec ctg tgt get ctg get gga gtg ctg aca 1392 
Ser Gly Met Leu Val Gly Ala Leu Cys Ala Leu . Ala Gly Val Leu Thr 

450 455 460 : 

» .■ 

. • • • i ■ 

at a gee atg cca gtg cct gtc att gtc aat aat ttt gga atg tac tac .1440 

lie Ala Met Pro Val Pro Val lie Val Asn Asn Phe Gly Met Tyr Tyr 

465 470 , 475 480 

•tec ttg gca atg gca aag cag aaa ctt cca agg' aaa aga aag aag cac 14 88 
Ser Leu Ala Met Ala Lys Gin Lys Leu Pro Arg Lys Arg Lys Lys His 
* 485 490 495 

ate cct cct get cct cag gca ag.c tea cct act ttt tgc aag aca gaa 1536 

lie Pro Pro Ala Pro Gin Ala Ser Ser Pro Thr Phe Cys Lys Thr Glu 

500 505 510 

* • 

tta aat atg gec tgc aat agt aca cag agt gac aca tgt ctg ggc aaa 1584 

Leu Asn Met Ala Cys Asn Ser Thr Gin Ser Asp Thr Cys Leu Gly Lys 

515 520 525 

gac aat cga ctt ctg gaa cat aac aga tea gtg tta tea ggt gac gac 1632 
Asp Asn Arg Leu Leu Glu His Asn Arg Ser Val Leu Ser Gly Asp Asp 
530 535 540 

agt aca gga agt gag peg cca eta tea ccc cca gaa agg cte ccc ate 1680 
Ser Thr Gly Ser Glu Pro Pro Leu Ser Pro Pro Glu Arg Leu Pro He 
545 550 555 560 

aga cgc tct agt acc aga gac aaa aac aga aga ggg gaa aca tgt ttc 1728 
Arg Arg Ser Ser Thr Arg Asp Lys Asn Arg Arg Gly Glu Thr Cys Phe • 
565 570 575 

eta ctg acg aca ggt gat tac acg tgt. get tct g^t gga ggg ate agg 1776 
Leu Leu Thr Thr Gly Asp Tyr Thr Cys Ala Ser Asp Gly Gly He Arg 
580 585 590 

aaa gga "tat gaa aaa tec cga age tta aac aac ata gcg ggc ttg gca 1824 
Lys Gly Tyr Glu Lys Ser Arg Ser Leu Asn Asn He Ala Gly Leu Ala 
595 600 605 

ggc aat get ctg agg etc tct cca gta aca tea ccc tac aac tct cct 1872 
Gly Asn Ala Leu Arg Leu Ser Pro Val Thr Ser Pro Tyr Asn Ser Pro 
610 615 620 

tgt cct ctg agg cgc tct cga tct ccc ate cca tct ate ttg taa 1917 
Cys Pro Leu Arg Arg Ser Arg Ser Pro He Pro Ser He Leu * 
625 ~ ~ 630 635 



<210> 13 
<211> 6582 
<212> DNA 

<213> Homo sapiens 
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<220> 
<221> CDS 
<222> (524 )T 



<400> 13 

ccacgcgtcc 

cgagagaaag 

gagggggaac 

gcaccccgcc 

tccaccacgc 

gtacccaggc 

gccgaggagg 

ggcgagggag 

agcccggtag 



gcccacgcgt 
actgagagga 
cctcgtcggt 
ccgcccctcc 
ccggaagagg 
tccctggccc 
ggctgcgagc 
gctgtggctc 
agcccagctc 



ccgcccacgc gtccgagaag gcttaggtgg gcaggcagga 60 
gggaaagccg cgtaggtggg agtacagcgg cgcgagggtc 120 
gcagatgagg agggtgggct ttcagaacta gtcccccctc 180 
cgcgctgggg tcttcacggt gccctgcctc agagcccggc 240 
gagtctggcc gtcggctggc tcagggcggg ccggttggcf 300 
gagtc^cggga ccagagcgcg ' gggcggcgcg gqagccgcgg 360* 
gaaacggcgc ggcgcggcac ggcggacgag ttagggccgg 420 
ccgacagaga caggggagta gtgtcgggct gaggcgagac 480 
agcgcccggc agccttcgac gcg atg ttc cgc egg 535 
, Met Phe Arg Arg 

1 



age 
Ser 
5 


ttg aat cgt 
Leu Asn Arg 


"ttt 
Phe 


tgt get 
Cys Ala 
10 


Gly 


gaa 
Glu 


aaa 
Glu 


aaa 
Lys 
15 


cga 
Arg 


att 

y u u 

Val 


yy w 
Gly 
I 


aca 
Thr 


cgc 
Arg 
20 


583 


aca 
Thr 


gtg 
Val 


ttt 
Phe 


gtt 
Val 

t 


Gly 
25 


aat 
Asn 


cat 
His 1 


cca 
Pro 


Val 


• 

teg 
Ser 
30' 


gaa 
Glu 


aca 
Thr 


y dd 

Glu 


you 

Ala 

» 


Tyr 
35 


a L l 

He 

i 


O J JL 


gca 
Ala 


caa aga 
Gin Arg 


ttt 
Phe 
40 


tat 
Cys 


gat aat 
Asp Asn 


aga 
Arg 


a t^a 
He 
45 


gtc 
Val 


tea 
Ser 


tet 
Ser 


aag 
Lys 


t:at 
Tyr 
50 


Thr 


Leu 




taa 
Trp 


aat 
Asn 


ttt 
Phe 
55 


etc 
Leu 


cca 
Pro 


aag 
Lys 


aat 
Asn 


ct g 
Leu 
60 


ttt 
Phe 


gaa 

Glu 
i 


cag 
Gin 


ttt 
Phe 


aga 
Arg 
' 65 


aga 
Arg 


att 
He 


gca 
AJ.a 


121 


aat 
Asn 


ttt 
Phe 
70 


tat 
Tyr 


ttt 
Phe 


etc 
Leu 


ata 
lie 


ate 
lie 
75 


ttc 
Phe 


ctt 
Leu 


gta 

Val 


cag 
Gin 


gtc 
Val 
80 


aca 
Thr 


ata 
Val 


aac 
Asp 


aca 
Thr 


775 


cca 
Pro 
85 


act 
Thr 


age 
Ser 


cca 
Pro 


gtt 
Val 


acc 
Thr 
90 


agt 
Ser 


gga 
Gly 


ctt 

'Leu 


cea 
Pro 


ctt 
Leu 
95 


ttc 
Phe 


'ttt 
Phe 


gtt 
Val 


ata 
He 


act 
Thr 
100 


823 


gtt 
Val 


aca 
Thr 


gee 
Ala 


ate 
lie 


aag 
Lys 
105 


cag gga 
Gin Gly 


tat 
Tyr 


gag 
Glu 


gat tgg 
Asp Trp 
110 


ctg 
Leu 


aga 
Arg 


cac 
His 


aga 
Arg 
115 


get 

Ala 


871 


gac 
Asp 


aat 
Asn 


gaa 
Glu 


gtc 
Val 
120 


aac 
Asn 


aaa 
Lys 


age 
Ser 


act 
Thr 


gtt 
Val 
125 


tac att 
Tyr lie 


att 
He 


gaa 
Glu 


aat 
Asn 
130 


gca 
Ala 


aag 
Lys 


919 


cga 
Arg 


gtg aga aaa 
Val Arg Lys 
135 


gaa 
Glu 


agt 
Ser 


gaa 
Glu 


aaa 
Lys 
140 


ate 
He 


aag 
Lys 


gtt 
Val 


ggt gat 
Gly Asp 
145 


gta 
Val 


gta 
Val 


gaa 
Glu 


967 


gta 
Val 


cag 
Gin 
150 


gca 
Ala 


gat 
Asp 


gaa 
Glu 


acc 
Thr 


ttt 
Phe 
155 


ccc 
Pro 


tgt 
Cys 


gat 
Asp 


ctt 
Leu 


att 
He 
160 


ctt 
Leu 


eta 
Leu 


tea 
Ser 


tct 
Ser 


1015 


tgc 
Cys 
165 


ace act gat 
Thr Thr Asp 


gga ace tgt 
Gly Thr Cys 
170 


tat 
Tyr 


gtc 

Val 


act 
Thr 


aca 
Thr 
175 


gee 
Ala 


agt 
Ser 


ctt 
Leu 


gat 
Asp 


ggg 

Gly 
180 


1063 
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gaa tec aat tgc aag aca cat tat gca gta cgt ' gat acc att gca ctg 1111 
Glu Ser Asn Cys Lys Thr His Tyr Ala Val Arg Asp Thr lie Ala Leu 

*" " T85 1 : 190 " ; ~ T?5" ~ . 

i • • 

tgt aca gca gaa tec ate gat acc etc cga gca gca att gaa .tgt gaa 1159 

Cys Thr Ala Glu Ser lie Asp Thr Leu Arg Ala AJa lie Glu Cys Glu 

200 205 210 

cag cct caa cct gac etc tac aaa ttt; gtt ggg cga ate aat ate tac . 1207 
Gin Pro Gin Pro Asp Leu. Tyr Lys Phe Val' Gly Arg lie Asn lie Tyr 
. 215 220 , 225 

agt aat agt ctt gag get gtt gee agg tct ttg gga cct gaa aat etc 1255 

Ser Asn Ser Leu Glu Ala Va»L Ala Arg Ser Leu Gly Pro Glu Asn Leu 

230 235 '240 

i 

ttg ctg aaa gga_ get acg eta aaa aat acc gag aag ata tat gga gtt 1303 

Leu Leu Lys Gly Ala Thr Leu Lys Asn Thr Glu Lys lie Tyr Gly Val 

245 250 255 .260 

get gtt tac act gga" atg gaa acc aaa atg get ttg aac tac caa ggg 1351 
Ala Val Tyr Thr Gly Met Glu Thr Lys Met Ala Leu Asn Tyr Gin Gly 
265 270 ■ ' 275 

aaa tct cag aaa cgt tct get gtt gaa aaa tct ; att aat get ttc ctg 1399 
Lys Ser Gin Lys Arg Ser Ala Val Glu Lys Ser lie Asn Ala Phe Leu . 
280 285 290 

att gta tat tta ttt ate tta ctg acc aaa get gca gta tgc act act 1447 
lie Val Tyr Leu Phe He Leu Leu Thr Lys Ala Ala Val Cys Thr Thr 
295 300 305 

eta aag tat gtt tgg caa agt acc cca tac aat gat gaa cct tgg tat 14 95 
Leu Lys Tyr Val Trp Gin Ser Thr Pro Tyr Asn Asp Glu Pro Trp Tyr 
310 315 320 

aac caa aag act cag aaa gag cga gag acc ttg aag gtt tta aaa atg 1543 
Asn Gin Lys Thr Gin Lys Glu Arg Glu Thr Leu Lys Val Leu Lys Met 
325 330 335 340 

,ttc acc gac ttc eta tea ttt atg gtt eta ttc aac ttt ate att cct 1591 
Phe Thr Asp Phe Leu Ser Phe Met Val Leu Phe Ash Phe He lie Pro 
345 350 355 

gtc tec atg tac gtc aca gta gaa atg cag aaa ttc ttg ggc tec ttc 1639 
Val Ser Met Tyr Val Thr Val Glu Met Gin Lys Phe Leu Gly Ser Phe 
360 365 370 

ttc ate tea tgg gat aag gac ttt tat gat gaa gaa att aat gaa gga 1687 
Phe He Ser Trp Asp Lys Asp Phe Tyr Asp Glu Glu He Asn Glu Gly 
375 380 385 

gee ctg gtt aac aca tea gac ctt aat gaa gaa ctt ggt cag gtg gat 1735 
Ala Leu Val Asn Thr Ser Asp Leu Asn Glu Glu Leu Gly Gin Val Asp 
390 395 400 

tat gta ttt aca gat aag act. gga aca etc act gaa aac age atg gaa 1783 
Tyr Val Phe Thr Asp Lys Thr Gly Thr Leu Thr Glu Asn Ser Met Glu 
405 410 415 420 
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ttc att gaa.tgc tgc a^a gat ggc cac aaa tat aaa ggt gta act caa 1831 
Phe lie Glu Cys Cys ^le Asp iGly His Lys Tyr Lys Gly Val Thr Gin ■ 
425 430 435 . 

• • » ' ' s , r 

gag gtt gat gga tta tct caa act gat gga act tta aca tat ttt gac 1879 
Glu Val Asp 1 Gly Leu Ser Gin Thr Asp Gly Thr Leu Thr Tyr Phe Asp 

440 445 ■'!■' 450 

aaa gta gat aag aat cga gaa gag ctg ttt eta cgt gee ttg tgt tta 1927 
Lys Val Asp Lys Asn Arg Glu Giu Leu Phe Leu Arg >Ala Leu Cys Leu t 
455 460 , 465 

i . 
tgt cat act gta gaa ate aaa aca aac gat get gtt gat gga get aca 1975 
Cys His Thr Val 'Glu He Lys Thr Asn Asp Ala Val Asp Gly Ala Thr 
470 i . 4-75 480 

• i . " ■ ■ 

gaa tea get gaa tta acc tat ate tec tct tea cca gat gaa ata get 2023 
Glu Ser Ala Glu Leu Thr Tyr He Ser Ser Ser Pro Asp Glu He Ala 
485 4 90 495 500 

ttg gtg aaa gga get aaa agg tac ggg ttc aca ttt tta gga aat cga 2071 
Leu Val Lys Gly Ala Lys Arg Tyr Gly Phe Thr Phe Leu Gly Asn Arg 
505 , ♦ 510 515 

aat gga tat atg aga gta gag aac caa aga aaa gaa ata gaa gaa tat! 2119 
Asn Gly Tyr Met Arg 'Val Glu Asn Gin Arg Lys Glu lie <?lu Glu Tyr 
520 525 530 

gaa ctt ctt cac acc tta aac ttt gat get gtc egg cga cgt atg agt 2167 
Glu Leu Leu His Thr Leu Asn Phe Asp Ala Val Arg Arg Arg Met Ser 
535 540 545 

gta att gtg aag act caa gaa gga gac ata ctt etc ttt tgt aaa gga 2215 
Val He Val Lys Thr Gin Glu Gly Asp He Leu Leu Phe Cys Lys Gly 
550 555 560 

gca gac teg gca gtt ttt ccc aga gtg caa aat cat gaa att gag tta 2263 
Ala Asp Ser Ala Val Phe Pro Arg Val Gin Asn His, Glu He Glu Leu 
565 - 570 575 580 

act aaa gtc cat gtg gaa cgt aat gca atg gat ggg tat egg aca etc 2311 
Thr Lys Val His Val Glu Arg Asn Ala Met Asp Gly Tyr Arg Thr. Leu 
585 5 90 595 

tgt gta gee ttc aaa gaa att get cca gat gat tat gaa aga att aac 2359 
Cys Val Ala Phe Lys Glu lie Ala Pro Asp Asp Tyr Glu Arg He Asn 
600 605 610 

aga cag etc ata gag gca aaa atg gee tta caa gac aga gaa gaa aaa 2407 
Arg Gin Leu He Glu Ala Lys Met Ala Leu Gin Asp Arg Glu Glu Lys 
615 620 625 

atg gaa aaa gtt ttc gat gat att gag aca aac atg aat tta att gga 2455 
Met Glu Lys Val Phe Asp Asp He Glu Thr Asn Met Asn Leu He Gly 
630 635 640 

gee act gca gtt gaa gac aag eta caa gat caa get gca gag acc att 2503 
Ala Thr Ala Val Glu Asp Lys Leu Gin Asp Gin Ala Ala Glu Thr lie 
645 650 655 660 
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gaa get ctg cat gca gca ggc ctg aaa gtc tgg gtg etc act ggg gac 2552 

Glu Ala Leu His Ala , Ala Gly » Leu "Lys Val Trp Val Leu Thr Gly Asp* 

665 670 675 ■ 

• • ' . • , r 

aag atg gag aca get aaa tec aca tgc ,t,at gec tgc cgc ctt ttc cag 2599 

Lys Met Glu* Thr Ala Lys Ser Thr Cys Tyr Ala Cys Arg Leu Phe Gin 

680 685 690 

acc aac act gag etc tta gaa eta acc aca aaa ace att gaa gaa agt 2647 

Thr Asn Thr Glu Leu Leu Glu Leu Thr Thr Lys Thr • lie Glu Glu Ser , 

695 f 700 , 705 

i ' ■• ' • • • 

gaa agg aaa gaa gat cga tta cat gaa tta ttg ata gaa tat cgc aag 2695 

Glu Arg Lys Glu 'Asp Arg Leu His Glu Leu Leu lie Glu Tyr Arg Lys 

710 t • 715 720 

■ ' • 

aaa ttg ctg cat gag ttt cct aaa agt act aga age ttt aaa aaa gca 2743 

Lys Leu Leu His Glu Phe Pro Lys Ser Thr Arg Ser Phe Lys Lys Ala 

725 730 735 740 

tgg aca gaa cat cag gaa tat gga tta ate ata gat ggc tec aca ttg 2791 

Trp Thr Glu His Gin Glu Tyr Gly Leu lie lie Asp Gly Ser Thr Leu 

745 , . : . 750 755 

tea etc ata ct,a aat tct agt caa gac tct agt tea aac aat tac aaa 2839 

Ser Leu lie Leu Asn 'Ser Ser Gin Asp Ser Ser Ser Asn Asn Tyr Lys 

7 60 7 65 770 

age att ttc eta caa ata tgt atg aag tgt act gea gtg etc tgctgt 2887 

Ser lie Phe Leu Gin lie Cys Met Lys Cys Thr Ala Val Leu Cys Cys 

775 780 785 

egg atg gca cca tta cag aaa gee cag att gtc aga atg gtg aag aat 2935 

Arg Met Ala Pro Leu Gin Lys Ala Gin lie Val Arg Met Val Lys Asn 

790 795 800 

tta aaa ggc age cca ata act ctg teg ata ggt gat ggt gee aat gat 2983 

Leu Lys Gly Ser Pro lie Thr Leu Ser lie Gly Asp. Gly Ala Asn Asp 

805 810 815 820 

gtt agt atg ate ttg gaa tec cat gtg gga ata ggt att aaa ggc aaa 3031 

Val Ser Met lie Leu Glu Ser His Val Gly lie Gly lie Lys Gly Lys 

825 830 835 

gaa ggt cgc caa gca get agg aat age gat tat tct gtt cca aag ttt 307 9 

Glu Gly Arg Gin Ala Ala Arg Asn Ser Asp Tyr Ser Val Pro Lys Phe 

840 845 850 

aaa cac tta aag aaa ctg ctg ttg get cat gga cat eta tat tat gtg 3127 

Lys His Leu Lys Lys Leu Leu Leu Ala His Gly His Leu Tyr Tyr Val 

855 860 ~ 865 

aga ata gca cac ctt gta cag tac ttc ttc tat aag aac ctt tgt ttc 3175 

Arg lie Ala His Leu Val Gin Tyr Phe Phe Tyr Lys Asn Leu Cys Phe 

870 875 880 

att ttg cca cag ttt ttg tac cag ttc ttc tgt gga ttc tea caa cag 3223 

lie Leu Pro Gin Phe Leu Tyr Gin Phe Phe Cys Giy Phe Ser Gin Gin 

885 890 895 900 



•38 



BNSDOCID: <WO„02055701 A2 J_> 



WO 02/055701 PCT/USOl/49060 



cca ctg tat gat get gpt tac ctt aca atg tac aat, ate tgc ttc aca 
Pro Leu Tyr Asp Ala Ala Tyr Leu Thr Met Tyf Asn lie Cys Phe Thr 
905 910 915 



3271 



tec ttg ccc ate ctg gec tat agt eta ctg gaa cag cac ate aac att 33.19. 
Ser Leu Pro lie Leu Ala Tyr Ser Leu Leu Glu Gin His lie Asn lie 
920 925 930 

gac act ctg ace tea gat ccc cga ttg tat atg aaa att tct ggc aat 3367 

Asp Thr Leu Thr Ser Asp Pro Arg Leu Tyr, Met Lys lie Ser Gly Asn 
935 940 945 

. , ' ' " 

gee atg eta cag ttg ggc ccc ttc tta tat tgg aca ttt ctg get gee 3415 

Ala Met Leu Gin Leu Gly Pro Phe Leu Tyr Trp Thr Phe Leu Ala Ala 
950 95i5 960 . 

'tt.t gaa ggg aca gtg ttc ttc ,ttt ggg act tac ttt ctt ttt cag act 34 63 
Phe Glu Gly Thr Val Phe Phe Phe Gly Thr Tyr Phe Leu Phe Gin Thr 
965 * - 970 975 980 

1 gca tec eta gaa gaa aat.gga aag gta tac gga aac tgg act ttt gga 3511 
Ala Ser Leu Glu Glu Asn Gly Lys Val Tyr Gly Asn Trp Thr Phe Gly 
985 990 995 

ace att gtt ttt aca gtc tta gta ttc act gta acc ctg aag ctt gec 3559. 
Thr lie Val Phe Thr Val Leu Val Phe Thr Val Thr Leu Lys Leu Ala 
1000 1005 • 1010 

ttg gat acc cga ttc tgg acg tgg ata aat cac ttt gtg att tgg ggt 3607 
Leu Asp Thr Arg Phe Trp Thr Trp lie Asn His Phe Val lie Trp Gly 
1015 1020 1025 

tct tta gee ttc tat gta ttt ttc tea ttc ttc tgg gga gga att att 3655 
Ser Leu Ala Phe Tyr Val Phe Phe Ser Phe Phe Trp Gly Gly lie lie 
1030 1035 1040 



tgg cct ttt etc aag caa cag aga atg tat ttt gta ttt gec caa atg 
Trp Pro Phe Leu Lys Gin Gin Arg Met Tyr Phe Val Phe Ala Gin Met 
1045 1050 1055 1060 



3703 



ctg tct tct gta tec aca tgg ttg get ata att ctt eta ata ttt ate 3751 
.Leu Ser Ser Val Ser Thr Trp Leu Ala lie lie Leu Leu lie Phe lie 
1065 1070 1075 

age ctg ttc cct gag att ctt ctg ata gta tta aag aat gta aga aga 37 99 
Ser Leu Phe Pro Glu lie Leu Leu lie Val Leu Lys Asn Val Arg Arg 
1080 1085 1090 

aga agt gec agg aga aat ctg age tgt aga agg gca tct gac tea tta 3847 
Arg Ser Ala Arg Arg Asn Leu Ser Cys Arg Arg Ala Ser Asp Ser Leu 
1095 1100 1105 

tec gec aga cct tea gtc aga cct ctt ctt tta cga aca ttc tea gac 3895 
Ser Ala Arg Pro Ser Val Arg Pro Leu Leu Leu Arg Thr Phe Ser Asp 
1110 " 1115 1120 

gaa tct aat gta ttg taa cagaatccga atcttgaact gcctatgtta 3943 

Glu Ser Asn Val Leu * 

1125 



-39- 



BNSDOCID: <WO__02055701A2J_> 



WO 02/055701 



PCT/US01/49060 



ttgtcctaca 
.„ acaaaaagtt 
gagttcac aa 



agcatactga ; 

atcatctcag 

taaatgttca 



cagtggttac 
gatacttgat 
ttaaaatacc 



agctaaaaaa 
atgcaacaca' 
aaatgattct 



gaaagcatga 
ctaaaccact 
cttaagcatt 



agaaacaact 
ctcatgtcta 
taccattatt 



4003 
4063 
4123 
4183 
4243 
4303 
4363 
4423 
4483 
4543 
4603 
4663 
.4723 
•4783 
4843 
4903 
4 963 
5023 
5083 
5143 
5203 
5263 
5323 
5383 
5443 
5503 
5563 
5623 
5683 
57 4 3 
5803 
5863 
5923 
5983 
6043 
6103 
6163 
6223 
6283 
6343 
6403 
6463 
6523 
6582 



gtaagtagcc 
atactta,gaa 
atgaactttg 
gaatagattt 
catttctcat 
ttgaaaatac 
gtttacccat 
gcttatgtgg 
tttttattct 
agtataaaac 
tttccagtgt 
aataaaggtc 
cttattttaa 
'tgctttttct 
tcagcttgta 
atggtctaac 
aggaaggttt 
1 atgttaaaca 
tgtttttttt 
atttggaaaa 
attcttccac 
agaacataat 
atttgctctc 
cctaagtgtg 
ctatattctt 
tttgtgtttg 
taaaatctat 
gcagtaccat 
attccttcca 
agctacacat 
catggcctag 
gctaaaaact 
ttcctagcac 
gatcatctac 
accgctgtcc 
cttacacata 
ttagtgatgg 
acaaaatcat 
.ttttttacat 
aacttgcctt 
aaaaaaaaaa 



tttatggcca 

ttctggcttt 

tgaatggatt 

tgctaatact 

tcctcctaaa 

agaataacta 

gtgctactaa 

aaagtgttaa 

tctgaattaa- 

atttccaatc 

tgattttact 

attctgaata 

acttttgacg 

taaacttggc 

gagccatgac 

tc.taaa.cc.at 

agccttctaa 

catacttttt 

ttgtctttct 

gtattggagt 

atatattttg 

ttcagttatc 

atatatcagt 

ctgtgttctg 

tgcctattcc 

ttttcttgtt 

gggaactgtg 

tggagagtta 

gccagaaaga 

tacattgtga 

atccttgaga 

ggccagtgtg 

tcaatgtaat 

ctataatttg 

attgtttgag 

caccagcaac 

aattttttaa 

ttttcagctt 

ctacgttgtt 

ttccattaca 

aaaaaaaaaa 



aagctgtaag. 

agttagagta 

taaatacaat 

actttttttg 

tgatcatctt 

aatagtatca 

tgtcttggta 

qttacgggta 

accccactta 

acttagtagc 

gggatctgca 

tcagcctttt 

gtttttattt 

caatttcfgaa 

tctgtgaaga 

gtaactgacc 

ttagttcact 

tggatataaa 

ggcatgcctg 

taatctgtat 

tgttaatgtt 

agaagccatt 

attatttttg 

gtagcccgta 

aaagagctaa 

cttatggttg 

aatacagaca 

acagaataat 

aagaaagaca 

tactgcagct 

gggacctggc 

gggttagcag 

caccctattt 

aataattttg 

gatgttacct 

agtataaatg 

taacatgcag 

gcaaaatggg 

gctttctaaa 

gaaatttttg 

aaaaaaaaaa 



ttaagaatta, 

atataactca 

agtatgaagt- 

cctggcagaa 

aatttttttt 

atgcatcaga 

gaatattctt 

tttttgtggg 

tgggtgtaag 

ccctcaaagt 

gtaaggtggt 

ataattttat 

gaaagagatt 

agggggaagg 

tgaatgttgt 

ttagtaaagt 

tgaaacataa 

tgaccatatt 

tactattatt 

tatatttata 

taggtatgat 

ccatcattat 

agcattttgt; 

tttgaggtaa 

aaaagtctaa 

ttgttgctgt 

agagagccac 

ctagtagaaa 

aggagtaagg 

caaattcaga 

tttccttttt 

atcgaataac 

gtgacagaga 

agctatcaaa 

actaaactga 

taagcctaac 

tatataaatg 

actgcaatat 

atgaatgtga 

tttgatgtaa 

aaaaaaaaaa 



tatgaaagtt 

aatgggtgct 

agaagttatg 

gaaatagact 

cccaagtaca 

cagaatagtt 

gccaaaaaaa 

aatagaaaaa 

cctactagac 

agttagaaaa 

ttaaaccata 

gtgaagagga 

gcatttatgc 

agccacccca 

ctcttaactt 

ccttgactaa 

atgtgaaatg 

tatttgactg 

aatgtttata 

tagtccatat 

ttttttctaa 

agaccctttt 

tacatgtcat 

gctgotgaaa 

cccaggaaag 

attatgattg 

agtagagagg 

aataactggt 

gggatttaga 

atggcaatga 

aaaagatatt 

ttgaaataga 

aagggaaaaa 

atgtctttgt 

aaacattcat 

tttgcaaaat 

tgcagatttt 

tacatttttc 

atgccatctt 

tcaataaact 

aaaaaaaaaa 



gaaagcaaga 
cttttaaccc 
caatgagaat 
atttggatca 
taaggaatac 
aatcccttct 
taccttgaaq 
aattgtttat 
ttgaaaataa 
taaacagatt 
gttatataaa 
agaaatatag 
atatatgcag 
aaacggtggt 
ggacagggaa 
ctgaactaga 
tcttcattca 
ctagtttttt 
ttgtaccttg 
ggcacatttg 
attctagaaa 
tcattatttc 
tcacaactta 
acaaaagtct 
cttttgatat 
ctg.ttttaca 
cttgtttaat 
tgcatgtaaa 
gttatgtctc 
tacatgatat 
ttactgaaga 
ccgtgcagta 
aatataataa 
aattttcaca 
tccatatcta 
tcgtaataat 
atgcgtgttg 
acttaagcag 
ttatgactgc 
ttggtatgat 
aaaaaaaaa* 



<210> 14 
<211> 1129 
<212> PRT 

<213> Homo sapiens 
<400> 14 

Met Phe Arg Arg Ser Leu Asn Arg Phe Cys Ala Gly Glu Glu Lys Arg 

1 5 10 15 

Val Gly Thr Arg Thr Val Phe Val Gly Asn His Pro Val Ser Glu Thr 

20 25 30 

Glu Ala Tyr lie Ala Gin Arg Phe Cys Asp Asn Arg He Val Ser Ser 

35 40 45 

Lys Tyr Thr Leu Trp Asn Phe Leu Pro Lys Asn Leu Phe Glu Gin Phe 

50 55 60 

Arg Arg He Ala Asn Phe Tyr Phe Leu lie lie Phe Leu Val Gin Val 
65 70 75 80 
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x Hi 


Val 


Asp 


Thr 


Pro 


Thr 


Ser 


Pro 


Val 


Thr Ser Gly Leu 


Pro 


Leu 


Phe 










85 


Thr 






90 








* 


95 






Val 


He 


Thr 


Val' 


Ala 


He 


Lys Gin Gly Tyr Glu Asp Trp Leu 








1UU 










iut> 






'thr 




rro- 


lie 


Am 


His Airg Ala Asp Asn 


Glu 


Val 


Asn Lys 


Ser 


Val 


Tvr 
x y.L 


lie' 






115. 










120 






125 








Glu 


Asn 


Ala 


Lys 


Arg 


Val 


Arg Lys Glu 


Ser Glu 


Lys lie 


T 


Val 


Gly 




130 










135 










140 








Asp 


Val 


Val 


Glu 


Val 


Gin 


Ala 


Asp 




Thr 
x nx 


rr i its 


Pro 


Cys 


Asp 


Leu 


He 


x. i «j 










150 










155 


i 






160 




Leu 


Ser 


Ser 


Cys 


Thr 


Thr Asp 


Gly 


Thr 

X 11JL 


^ys ( 


Tyr 


Val 


Thr 


Thr 

X 1 IX 


Ala 










165 


V 






1 


170 


» 










Ser 


Leu Asp Gly Glu 


Ser 


Asn Cys 


Lys 


Thr 
X IlX 


U -i o 

nxs 


Tyr Ala 


Val 


Arg 


Asp 








180 » 








185 










i on 






Thr 

1 Hi 


lie 


Ala 


Leu 


Cys Thr 


Ala 


Glu 


Ser 


lie 


Asp 


Thr 


Leu 


Am 


Al a 


Ala 






195 










200 










205 








Tip 


Glu 
210 


Cys 


Glu 


Gin' 


Pro 


Gin 
215 


Pro 


Asp 


Leu 


Tyr 


Lys 
220 


Phe 


V ClJ. 




Arg 


T1p 


Asn 


lie 


Tyr 


Ser 


Asn 


Ser 


Leu 


Glu 


AJ.a 


val 


Ala 


Arg 


Ser 


Leu 


Gly 


Z £D 










230 










OIK 










240 


pro 


Glu Asn 


Leu 


Leu 


Leu 


Lys 


Gly Ala 


m v-» v 

Thr 


Leu 


Lys Asn 


Thrr 

x nii 




Lys 










245 










250 










Zoo 




T 1 o 


Tyr 


Gly Val .Ala 


Val 


Tyr Thr 

i 


Gly 


Met 


Glu 


Thr 


Lys 


i v jen 




Leu 








260 






265 










O "7 f\ 






noil 


Tyr Gin 


Gly 


Lys 


Ser 


Gin 


Lys 


Arg 


Ser 


Ala 


Val' Glu 


*Lys 


Ser 


He 






275 


i 








280 










285 










Ala 


Phe 


Leu 


lie 


Val 


Tyr 


Leu 


Phe 


lie 


Leu 


Leu 


Thr 


Lys 


Ala 


Ala 




290 










295 










300 








Val 


Cys 


Thr 


Thr 


Leu 


Lys 


Tyr 


Val 


Trp 


Gin 


Ser 


Thr 


Pro 


Tyr 


TV «■« 

Asn 


Asp 












310 










315 










320 


r* l in 

olU 


Pro 


Trp 


Tyr 


Asn 


Gin 


Lys 


Thr Gin 


Lys 


Glu 


Arg 


Glu 


Thr 


Leu 


Lys 










325 










330 














Vdl 


Leu 


Lys 


Met 
340 


Phe 


Thr 


Asp 


Phe 


Leu 
345 


Ser 


Phe 


Met 


Val 


Leu 

Ten 


rne 


Asn 


rue 


He 


lie 


Pro 


Val 


Ser 


Met Tyr 


Val 


Thr 


Val 


Glu 


Met 


Gin 


Lys 


Phe 






355 










360 










365 






xjc vx 


Gly 


Ser 


Phe 


Phe 


He 


Ser Trp Asp 


Lys Asp 


Phe 


Tyr 


Asp 


C\ tn 

LiXU 


Glu 




370 










375 










380, 










He 


Asn 


Glu 


Gly Ala 


Leu 


Val 


Asn 


Thr 


Ser 


Asp 


Leu 


ASn 


pi 

Vc>XU 


bxll 


Leu 


■ape 

JO J 










390 










395 










400 


vjiy 


Gin 


Val 


Asp 


Tyr 


Val 


Phe 


Thr 


Asp 


Lys 


Thr 


Gly 


Thr 


Leu 


i nr 


Glu 










405 










410 








A 1 K 

H 1 0 




Sen 
no 1 1 


Ser 


Met 


Glu 
4 20 


Phe 


He 


Glu 


Cys 


Cys 
425 


He 


Asp 


Gly 


His 


Lys 

*i 3 u 


iyr 


Lys 




Val 


Thr 


Gin 


Glu 


Val 


Asp 


Gly 


Leu 


Ser Gin 


Thr Asp 




Thr 
X IlX 


Leu 






435 










440 










4 45 








Thr 

X 111. 


Tyr 


Phe 


Asp 


Lys 


Val 


Asp 


Lys 


Asn 


Arg Glu 


Glu 


Leu 


Ph p» 

rnc 


Lieu 


Arg 




450 










455 










4 60 










Ala 


Leu 


Cys 


Leu 


Cys 


His 


Thr 


Val 


Glu 


He. 


Lys 


Thr 


Asn 


Asp 


al'- 
nld 


Val 


" \j *j 










470 










475 










480 




Gly Ala 


Thr 


Glu 


Ser 


Ala 


Glu 


Leu 


Thr 


Tyr 


He 


Ser 




Cor 


Pro 










485 










490 










H^O 






Glu 


He 


Ala 


Leu 


Val 


Lys 


Gly Ala 


Lys Arg 


Tyr 


Gly 


Phe 


Thr 


Phe 








500 










505 










510 






Leu 


Gly Asn Arg Asn Gly 


Tyr 


Met 


Arg 


Val 


Glu 


Asn 


Gin 


Arg 


Lys 


Glu 






515 










520 










525 








He 


Glu 


Glu 


Tyr 


Glu 


Leu 


Leu 


His 


Thr 


Leu 


Asn 


Phe 


Asp 


Ala 


Val 


Arg 




530 










535 










540 








Arg 


Arg 


Met 


Ser 


Val 


He 


Val 


Lys 


Thr 


Gin 


Glu 


Gly Asp 


He 


Leu 


Leu 


545 










550 










555 










560 
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Phe Cys Lys Gly Ala Asp Ser Ala Val Phe Pro Arg Val Gin Asn His 

565 ^ M . „ 570 , ■ . , 575 , 

Glu lie Glu Leu Thr *Lys Val His Val Glu A*g Asn Ala Met Asp Gly 

— — 5F0" — ; ; j — 583 : ; — s-^tr- f 

Tyr Arg Thr Leu Cys Val Ala Phe Lys Glu lie Ala Pro Asp Asp 'Tyr 

595. 600 605 

Glu Arg lie Asn Arg Gin Leu lie Glu Ala. Lys Met 'Ala Leu Gin Asp 

610 615 620 

Arg Glu Glu Lys Met Glu Lys Val Phe Asp Asp lie Glu Thr Ash Met 
625 630 635 , 640 

Asn Leu lie Gly Ala Thr Ala Val Glu Asp Lys Leu Gin Asp Gin Ala 

645 ' • , • 650 ' . 655 

Ala Glu Thr lie Glu Ala Leu His Ala Ala Gly Leu Lys Val Trp Val 

660 , 665 670 

Leu Thr Gly f Asp Lys Met Glu Thr Ala Lys Ser Thr Cys Tyr. Ala Cys 

675 68.0 685 

Arg Leu Phe Gin Thr Asn Thr Glu Leu Leu Glu Leu Thr Thr Lys Thr 

690 695 700 

lie Glu Giu Ser Glu Arg Lys Glu Asp Arg Leu His Glu Leu Leu lie 
705 710 715 720 

Glu Tyr Arg' Lys Lys Leu Leu His Glu Phe Pro Lys Ser Thr Arg Ser 

725 730 735 

Phe Lys Lys Ala Trp Thr Glu His Gin Glu Tyr Gly Leu lie lie Asp 

740 . ' 745 750. 

Gly Ser Thr Leu Ser Leu lie Leu Asn Ser* Ser Gin' Asp !ser Ser Seit 

755 ' ■ 760 765 

Asn Asn Tyr Lys Ser lie Phe Leu GXn lie Cys Met Lys Cys Thr Ala 

770 775 780 

Val Leu Cys Cys Arg Met Ala Pro Leu Gin Lys Ala Gin lie Val Arg 
785 ~ 790. 795 800 

Met Val Lys Asn Leu Lys Gly Ser Pro lie Thr Leu Ser lie Gly Asp 

805 ■ 810 * 815 , 

Gly Ala Asn Asp Val Ser Met lie Leu Giu Ser His Val Gly lie Gly 

820 825 830 

lie Lys Gly Lys Glu Gly Arg Gin Ala Ala Arg Asn Ser Asp Tyr Ser 

835 840 845 

Val Pro Lys Phe Lys His Leu Lys Lys Leu Leu Leu Ala His Gly His 

850 855 • 860 ( 

Leu Tyr Tyr Val Arg lie Ala His Leu Val Gin Tyr Phe Phe Tyr Lys 
865 870 ' 875 880 

Asn Leu Cys Phe He Leu Pro Gin Phe Leu Tyr Gin Phe Phe Cys Gly 

885 890 895 

Phe Ser Gin Gin Pro Leu Tyr Asp Ala Ala Tyr Leu Thr Met Tyr Asn 

900 905 910 

He Cys Phe Thr Ser Leu Pro He Leu Ala Tyr Ser Leu Leu Glu Gin 

915 920 925 

His He Asn He Asp Thr Leu Thr Ser Asp Pro Arg Leu Tyr Met Lys 

930 935 940 

He. Ser Gly Asn Ala Met Leu Gin Leu Gly Pro Phe Leu Tyr Trp Thr 
945 950 955 960 

Phe Leu Ala Ala Phe Glu Gly Thr Val Phe Phe Phe Gly Thr Tyr Phe 

965 970 975 

Leu Phe Gin Thr Ala Ser Leu- Glu Glu Asn Gly Lys Val Tyr Gly Asn 

980 985 990 

Trp Thr Phe Gly Thr lie Val Phe Thr Val Leu Val Phe Thr Val Thr 

995 1000 1005 

Leu Lys Leu Ala Leu Asp Thr Arg Phe Trp Thr Trp lie Asn His Phe 

1010 1015 1020 

Val He Trp Gly Ser Leu Ala Phe Tyr Val Phe Phe Ser Phe Phe Trp 
1025 1030 1035 . 1040 
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Gly Gly lie lie Trp Pro Phe Leu I»ys Gin Gin Arg Met Tyr Phe Val 

104 5 1050- , 1055 

Phe Ala Gin Met Leu Ser Ser Val- Ser Thr Trp Leu Al a He He Leu 

: . .1060 : J065 ... ~ " ~r070 " " — 

Leu He Phe lie Ser Leu Phe Pro Glu He Leu Leu lie Val Leu Lys 

' 1075 1080 ■ 1085 

Asn Val Arg Arg Arg Ser Ala Arg Arg Asn Leu Ser Cys Arg Arg Ala 

1090 1095 1100 

Ser Asp Ser Leii Ser Ala Arg Pro Ser Val Arg Pro Leu Leu Leu Arg 
1105 1110 ( 1115. 1120 

Thr Phe Ser Asp Glu Ser . Asn Val Leu ' ' : ■ ■ ' 

1125 . ? , » ' 



<210> 


15 


<211> 


3390 


<212> 


DNA . 


<2i3> 


Homo 


<220> 




<221> 


CDS 


<222> 


(1) . . 


<400> 


15 


atg ttc cgc 


Met Phe Arg 


1 





. (3390) 



5 10 , 15 



4 8 



gtt ggc aca cgc aca gtg ttt gtt ggc aat cat cca gtt teg gaa aca 96 

Val Gly Thr Arg Thr Val Phe Val Gly Asn His Pro Val Ser Glu Thr 

20 25 30 

gaa get tac att gca caa aga ttt tgt gat aat aga ata gtc tea tct 144 

Glu Ala Tyr lie Ala Gin Arg Phe Cys Asp Asn Arg He Val Ser Ser 

35 40 45 

aag tat aca ctt tgg aat ttt etc cca aag aat ctg ttt gaa cag ttt 192 

Lys Tyr Thr Leu Trp Asn Phe Leu Pro Lys Asn Leu Phe Glu Gin Phe 

' 50 55 60 

aga aga att gca aat ttt tat ttt etc ata ate ttc ctt gta cag gtc 240 

Arg Arg He Ala Asn Phe Tyr Phe Leu He He Phe Leu Val Gin Val 

65 70 75 80 

aca gta "gac aca cca act age cca gtt ace agt gga ctt cca ctt ttc 288 

Thr Val Asp Thr Pro Thr Ser Pro Val Thr Ser Gly Leu Pro Leu Phe 

85 90 95 

ttt gtt ata act gtt aca gee ate aag cag gga tat gag gat tgg ctg 336 

the Val lie Thr Val Thr Ala He Lys Gin Gly Tyr Glu Asp Trp Leu 

100 105 110 

aga cac aga get gac aat gaa gtc aac aaa age act gtt tac att att 384 

Arg His Arg Ala Asp Asn Glu Val Asn. Lys Ser Thr Val Tyr He He 

115 120 125 

gaa aat gca aag cga gtg aga aaa gaa agt gaa aaa ate aag gtt ggt 432 

Glu Asn Ala Lys Arg Val Arg Lys Glu Ser Glu Lys He Lys Val Gly 

130 135 140 
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gat gta gta gaa gta cag gca gat gaa acc ttt ccc tgt gat ctt att 4 80 

Asp Val Val Glu Val Gin Ala ,Asp Glu T£r Phe Pro Cys Asp Leu I}e . 

145 'l50 155 160 

ctt eta tea tct tgc acc act gat gga acc tgt tat gtc act aca gec 528 

Leu Leu Ser » Ser Cys Thr Thr Asp Gly Thr Cys Tyr Val Thr Thr Ala . . 

165 170. 1 1 175 

agt ctt gat ggg gaa tec aat tgc aag aca cat tat gca gta cgt gat 576 

Ser Leu Asp Gly Glu Ser Asn Cys Lys Thr His Tyr ,Ala Val Arg Asp 

180 185 , 190 

1 i » , . . ' 
i 

acc att gca ctg tgt aca gca gaa tec ate gat acc etc cga gca gca 624 

Thr lie Ala Leu Cys Thr Ala Glu Ser lie Asp Thr Leu Arg Ala Ala 

195, , 200 205 " 
♦ " • 

att gaa tgt gaa ca"g cct caa cct gac etc tac aaa ttt gtt ggg cga 672 

lie Glu Cys Glu Gin Pro Gin Pro Asp Leu Tyr Lys Phe Val Gly Arg 

210 * ' 215 220 



ate aat ate* tac agt aat agt ctt gag get gtt gee agg tct ttg gga 
lie Asn lie Tyr Ser Asn Ser Leu Glu Ala Val Ala Arg Ser Leu Gly 
225 230 . ' 235 240 



720 



cct gaa aat etc ttg ctg aaa gga get acg eta aaa ' aat acc gag aac| 768 
Pro Glu Asn Leu Leu 'Leu Lys Gly Ala Thr Leu Lys Asn Thr Glu Lys 
245 , 250 255 

ata tat gga gtt get gtt tac act gga atg gaa acc aaa atg get ttg 816 

lie Tyr Gly Val Ala Val Tyr Thr Gly Met Glu Thr Lys Met Ala Leu 
260 265 270 

» 

aac tac caa ggg aaa tct cag aaa cgt tct get gtt gaa aaa tct att 864 

Asn Tyr Gin Gly Lys Ser Gin Lys Arg Ser Ala Val Glu Lys Ser lie 
275 280 285 

aat get ttc ctg att gta tat tta ttt ate tta ctg acc aaa get gca 912 
Asn Ala Phe Leu lie Val Tyr Leu Phe lie Leu Leu , Thr Lys Ala Ala 
290 295 300 * 

gta tgc act act eta aag tat gtt tgg caa agt acc cca tac aat gat 960 
Val Cys Thr Thr Leu Lys Tyr Val Trp Gin Ser Thr Pro Tyr Asn Asp 
305 310 315 320 

gaa cct tgg tat aac caa aag act cag aa'a gag cga gag acc ttg aag 1008 
Glu Pro Trp Tyr Asn Gin Lys Thr Gin Lys Glu Arg Glu Thr Leu Lys 
325 330 335 

gtt tta aaa atg ttc acc gac ttc eta tea ttt atg gtt eta ttc aac 1056 
Val Leu Lys Met Phe Thr Asp Phe Leu Ser Phe Met Val Leu Phe Asn 
340 345 350 

ttt ate att cct gtc tec atg tac gtc aca gta gaa atg cag aaa ttc 1104 
Phe He He Pro Val Ser Met Tyr Val Thr Val Glu Met Gin Lys Phe 
. 355 360 365 

ttg ggc tec ttc ttc ate tea tgg gat aag gac ttt tat gat gaa gaa 1152 
Leu Gly Ser Phe Phe He Ser Trp Asp Lys Asp Phe Tyr Asp Glu Glu 
370 375 380 
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att aat gaa gga gcc ctg gtt aac aca tea gac ctt aat gaa gaa ctt .1200 

lie Asn Glu Gly Ala Leu Val Asn Thr Ser Asp . Leu Asn Glu Glu Leu 

385 390- 395 400 

• * * *' i i ' • ' ■ 

ggt cag gtg gat tat gta ttt aca gat aag act gga aca etc act gaa 1248 

Gly Gin' Val Asp Tyr Val Phe Thr Asp Lys Thr- Gly Thr Leu Thr Glu 
: ~ 4 05 410 ( ' 415 

aac age atg gaa ttc att gaa tgc tgc ata gat ggc cac aaa tat aaa 1296 

Asn Ser Met Glu Phe lie Glu Cys Cys lie Asp Gly His Lys Tyr Lys 
420 425 ' • 430 

ggt gta act caa gag gtt gat gga tta tct caa act gat gga act tta . 1344 

Gly Val Thr Gin Glu Val Asp Gly Leu Ser Gin Thr Asp Gly Thr Leu 
435 ,4 40 445 

kca tat ttt gac aaa gta gat aag aat cga gaa gag ctg ttt eta cgt .1392 

Thr Tyr Phe Asp Lys Val Asp Lys Asn Arg Glu Glu Leu Phe Leu Arg 
450 * 455 460 

•gcc ttg tgt tta tgt cat act gta gaa ate aaa aca aac gat get gtt 1440 

Ala Leu Cys Leu Cys His Thr Val Glu lie Lys Thr Asn Asp Ala Val 

465 470 475 480 

gat gga get aca gaa tea get gaa tta ace tat ate tec tct tea cca 14.88 

Asp Gly Ala Thr Glu Ser Ala Glu Leu Thr Tyr lie Ser Ser Ser Pro 
485 490 i 495 

gat gaa ata get ttg gtg aaa gga get aaa agg tac ggg ttc aca ttt 1536 

Asp Glu lie Ala Leu Val Lys Gly Ala Lys Arg Tyr Gly Phe Thr Phe 
500 505 510 

tta gga aat cga aat gga tat atg aga gta gag aac caa aga aaa gaa 1584 

Leu Gly Asn Arg Asn Gly Tyr Met Arg Val Glu Asn Gin Arg Lys Glu 
515 " 520 525 

ata gaa gaa tat. gaa ctt ctt cac ace tta aac ttt gat get gtc egg 1632 

lie Glu Glu Tyr Glu Leu Leu His Thr Leu Asn Phe Asp Ala Val Arg 
530 535 540 

cga cgt atg agt gta att gtg aag act caa gaa gga gac ata ctt etc 1680 

Arg Arg Met Ser Val lie Val Lys Thr Gin Glu Gly Asp lie Leu Leu 

545 550 555 560 

ttt tgt aaa gga gca gac teg gca gtt ttt ccc aga gtg caa aat cat 1728 
Phe Cys Lys Gly Ala Asp Ser Ala Val Phe Pro Arg Val Gin Asn His 
565 .570 575 

gaa att gag tta act aaa gtc cat gtg gaa cgt aat gca atg gat ggg 177 6 
5lu lie Glu Leu Thr Lys Val His Val Glu Arg Asn Ala Met Asp Gly 
580 585 590 

tat egg aca etc tgt gta gcc ttc aaa gaa att get cca gat gat tat . 1824 
Tyr Arg Thr Leu Cys Val Ala Phe Lys Glu lie Ala Pro Asp Asp Tyr 
595 600 605 

gaa aga att aac aga cag etc ata gag gca aaa atg gcc tta caa gac 1872 
Glu Arg lie Asn Arg Gin Leu lie Glu Ala Lys Met Ala Leu Gin Asp 
610 615 620 
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aga gaa gaa aaa atg gaa aaa gtt ttc gat gat att gag aca aac atg .1920 

Arg Glu Glu Lys Met Glu Lys Val Phe Asp Asp lie Glu Thr Asri Met 

625 630 635 640 

aat tta att gga gcc act gca gtt gaa gac aag eta caa gat caa get 1968 

Asn Leu lie Gly Ala Thr Ala Val Glu Asp Lys Leu Gin Asp .Gin Ala 

645 650 , ' 655 

gca gag acc att gaa get ctg cat gca gca ggc ctg aaa gtc tgg gtg 2016 

Ala Glu Thr lie Glu Ala Leu His Ala Ala Gly Leu Lys Val Trp Val 

660 665 670 

etc act ggg gac aag atg gag aca get aaa. tec aca tgc tat gcc tgc. .2064 

Leu Thr Gly Asp Lys Met .Glu Thr Ala Lys Ser Thr Cys Tyr Ala Cys 

675 , 680 685 

cgc ctt ttc cag acc aac act gag etc tta gaa eta acc aca aaa acc 2112 

Arg Leu Phe Gin Thr Asn Thr Glu Leu Leu Glu Leu Thr Thr Lys Thr 

690 . 695 700 

att gaa gaa agt gaa agg aaa gaa gat cga tta cat gaa tta ttg ata 2160 

lie Glu Glu Ser Glu Arg Lys Glu Asp Arg Leu His Glu Leu Leu lie 

705 710 715 720 

gaa tat cgc aag aaa ttg ctg cat gag ttt cct aaa agt act aga age 2208 

Glu Tyr Arg Lys Lys Leu Leu His Glu Phe Pro Lys Ser Thr Arg Ser 

725 730 * i. 7 35 

ttt aaa aaa gca tgg aca gaa cat cag gaa tat gga tta ate ata gat 2256 

Phe Lys Lys Ala Trp Thr Glu His Gin Glu Tyr Gly Leu lie He Asp 

740 745 750 

ggc tec aca ttg tea etc ata eta aat tct agt caa gac tct agt tea 2304 

Gly Ser Thr Leu Ser Leu lie Leu Asn Ser Ser Gin Asp Ser Ser Ser 

755 7 60 7 65 

aac aat tac aaa age att ttc eta caa ata tgt atg aag tgt act gca 2352 

Asn Asn Tyr Lys Ser He Phe Leu Gin He Cys Met Lys Gys Thr Ala 

770 775 780 

gtg etc tgc tgt egg atg gca cca tta cag aaa gcc cag att gtc aga 24 00 

,Val Leu Cys Cys Arg Met Ala Pro Leu Gin Lys Ala Gin lie Val Arg 

785 790 795 800 

atg gtg aag aat tta aaa ggc age cca ata act ctg teg ata ggt gat 24 48 

Met Val Lys Asn Leu Lys Gly Ser Pro He Thr Leu Ser He Gly Asp 

805 810 815 

ggt gcc aat gat gtt agt atg ate ttg gaa tec cat gtg gga ata ggt 2496 
Gly Ala Asn Asp Val Ser Met He Leu Glu Ser His Val Gly He Gly 

820 825 830 

att aaa ggc aaa gaa ggt cgc caa gca get agg aat age gat tat tct 2544 
He Lys Gly Lys Glu Gly Arg Gin Ala Ala Arg Asn Ser Asp Tyr Ser 

835 840 845 

gtt cca aag ttt aaa cac tta aag aaa ctg ctg ttg get cat gga cat 2592 

Val Pro Lys Phe Lys His Leu Lys Lys Leu Leu Leu Ala His Gly His 

850 855 860 
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eta tat tat gtg aga ata gca cac ctt gta cag tac ttc ttc tat aag 2640* 

Leu Tyr Tyr Val Arg ^le Ala »His Leu Val Gin Tyr Phe Phe Tyr Lys ■ 

865 870 875 880 

i , r 

aac ctt tgt ttc att ttg cca cag ttt ttg tac cag ttc ttc tgt gga 2688 

Asn Leu Cys 'Phe lie Leu Pro Gin Phe Leu Tyr Gin Phe Phe Cys Gly 

885 890 . " ' 895 



ttc tea 
Phe Ser 



ate tgc 
lie Cys 



caa cag cca ctg tat gat get get tac ctt aca atg tac aat 
Gin Gin Pro Leu Tyr Asp Ala Ala Tyr Leu ,Thr Met Tyr Asn 
900 905 



i 

ttc aca tec ttg ccc ate ctg gee tat agt eta 
Phe Thr Ser Leu Pro lie Leu Ala Tyr Ser Leu 
915, , 920 925 



910 

ctg gaa cag 
Leu Glu Gin 



2736 



2784 



cac ate aac att gac act ctg ace tea gat ccc cga ttg tat atg aaa 
His lie Asn lie Asp Thr Leu Thr Ser Asp Pro Arg Leu Tyr Met Lys. 
930 935 ' 940 



2832 



att tct ggc aat gee atg eta cag ttg ggc ccc ttc.tta tat tgg aca 

lie Ser Gly Asn Ala Met Leu Gin Leu Glfy Pro Phe Leu Tyr Trp Thr 

945. 950 ( , : 955 960 

i ^ 

ttt ctg get gee ttt gaa ggg aca gtg ttc* ttc ttt ggg act tac ttt 1 

Phe Leu Ala Ala Phe Glu Gly Thr Val Phe Phe Phe Gly Thr Tyr Phe 

965 • 970 975 



2880 



2928 



ctt ttt cag act gca tec eta gaa gaa aat gga aag gta tac gga aac 
Leu Phe Gin Thr Ala Ser Leu Glu Glu Asn Gly Lys Val Tyr Gly Asn 
980 985 990 



2976 



tgg act ttt gga ace att gtt ttt aca gtc tta gta ttc act gta acc 
Trp Thr Phe Gly Thr lie Val Phe Thr Val Leu Val Phe Thr Val Thr 
995 1000 1005 



3024 



ctg aag ctt gee ttg gat acc cga ttc tgg acg tgg ata aat cac ttt 
Leu Lys Leu Ala Leu Asp Thr Arg Phe Trp Thr Trp ,Ile Asn His Phe 
1010 1015 1020 ' 



3072 



gtg att tgg ggt tct tta gee ttc tat gta ttt ttc tea ttc ttc tgg 
Val lie Trp Gly Ser Leu Ala Phe Tyr Val Phe Phe Ser Phe Phe Trp 
1025 ~ 1030 1035 1040 



3120 



gga gga att att tgg cct ttt etc aag caa cag aga atg tat ttt gta 
Gly Gly lie He Trp Pro Phe Leu Lys Gin Gin Arg Met Tyr Phe Val 
1045 1050 1055 



3168 



ttt gec caa atg ctg tct tct gta tec aca tgg ttg get ata att ctt 
Phe Ala Gin Met Leu Ser Ser Val Ser Thr Trp Leu Ala lie He Leu 
1060 1065 1070 



3216 



eta ata ttt ate age ctg ttc cct gag att ctt ctg ata gta tta aag 
Leu lie Phe He Ser Leu Phe Pro Glu He Leu Leu He Val Leu Lys 
1075 1080 1085 



3264 



aat gta aga aga aga agt gee agg aga aat ctg age tgt aga agg gca 
Asn Val Arg Arg Arg Ser Ala Arg Arg Asn Leu Ser Cys Arg Arg Ala 
1090 1095 1100 



3312 
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tct gac tea tta tec gec aga cct-t'ca'-gtc aga cct ctt ctt tta cga 
Ser Asp Ser Leu Ser ,Ala Arg i«Pro Ser "Val 'Arg Pro Leu Leu Leu Arg ■* 
1105 1110 ■ 1115 ,.. 1120 



3360 



aca ttc tea gac gaa tct aat gta ttg £aa 
Thr Phe Ser' Asp Glu Ser Asn Val Leu * 
1125 



3390 



<210> 16 

<211> 6073 

<212> DNA 

<213> Homo sapiens 



<220> i 
<221> CDS 
<222> (273) . 



(4553) 



<400> 16 

ccacgcgtcc 

ctggctcccg 

aatggctggc 

ttgtaccttg 

ctacacaacc 



gggaggagcg gagggagaag taggttgega" gctcagcaca ggctccggcg 60 

cagctgagtt tgggagatgt ctaagtgatt tttttttttc 1 ceggaaggea 120 

gtggaagcac aacccgcttt cactcttcga atttgtgctt agctcttttc 180 

cgactegtga ccaacatgct gtgatgtgtig ccgagggagg aattggtcag 24 0 

tggatcttac cac&gtttgg at atq act gag get ptc caa tgq 293 

Met Thr Glu' Ala Leu Gin Trp 
1 ,5 



gee aga tat cac tgg cga egg ctg ate aga ggt gca ace agg gat gat 341 
Ala Arg Tyr His Trp Arg Arg Leu lie Arg Gly Ala Thr Arg Asp Asp 
10 15 20 



gat tea ggg cca tac aac tat tec teg ttg etc gee ,tgt ggg cgc aag 
Asp Ser Gly Pro Tyr Asn Tyr Ser Ser Leu Leu Ala Cys Gly Arg Lys 
25 30 35 



389 



tec tct cag ate cct aaa ctg tea gga agg cac egg att gtt gtt ccc 
Ser Ser Gin lie Pro Lys Leu Ser Gly Arg His Arg lie Val Val Pro 
40 45 50 55 



437 



cac ate cag ccc ttc aag gat gag tat gag aag ttc tec gga gec tat 
His lie Gin Pro Phe Lys Asp Glu Tyr Glu Lys Phe Ser Gly Ala Tyr 
60 65 70. 



485 



gtg aac aat cga ata cga aca aca aag tac aca. ctt ctg aat ttt gtg 
Val Asn Asn Arg lie Arg Thr Thr Lys Tyr Thr Leu Leu Asn Phe Val 
75 80 85 



533 



cca aga aat tta ttt gaa caa ttt cac aga get gee agt tta tat ttc 
Pro Arg Asn Leu Phe Glu Gin Phe His Arg Ala Ala Ser Leu Tyr Phe 
90 95 100 



581 



ctg ttc eta gtt gtc ctg aac tgg gta cct ttg gta gaa gee ttc caa 
Leu Phe Leu Val Val Leu Asn Trp Val Pro Leu Val Glu Ala Phe Gin 
105 110 115 



629 



aag gaa ate ace atg ttg cct. ctg gtg gtg gtc ctt aca att ate gca 
Lys Glu lie Thr Met Leu Pro Leu Val Val Val Leu Thr lie lie Ala 
120 125 130 135 



677 
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att aaa gat ggc ctg gaa gat tat egg aaa tac aaa att gac aaa cag .725 
, lie Lys Asp Gly Leu Glu Asp Tyr Arg Lys Tyr- Lys lie Asp Lys Gin 

140 . 145 150 

ate aat ( aat tta ata act aaa gtt tat agt agg aaa gag aaa aaa tac . 773 
lie Asn Asn Leu lie Thr Lys Val Tyr Ser Arg Lys Glu Lys .Lys Tyr 
155 160 . 165 

att gac cga tgc tgg aaa gac gtt act gtt ggg gac ttt att cgc etc 821 
lie Asp Arg Cys Trp Lys Asp Val Thr, Val Gly Asp Phe lie Arg Leu 

170 ~ 175 180 ' 

f * • 

• • i 1 

tec tgc aat gag gtc ate cct gca gac atg gta eta etc ttt tec act .869 
Ser Cys Asn Glu Val lie Pro Ala Asp Met Val Leu Leu Phe Ser Thr 
185 190 195 

fc(a£ cca gat gga ate tgt cac att gag act tct ggt ctt gat gga gag 917 
Asp Pro Asp Gly lie Cys His He Glu Thr Ser Gly Leu Asp Gly Glu 
200 * 205 210 ' 215 

'age aat tta aaa cag agg cag gtg gtt egg gga tat gca gaa cag gac 965 
Ser Asn Leu Lys Gin Arg Gin Val Val Arg Gly Tyr Ala Glu Gin Asp 
220 225 230 . 

tct gaa gtt gat cct gag aag ttt tec agt agg ata gaa tgt gaa age 1013 
Ser Glu Val Asp Pro Glu Lys Phe Ser Ser Arg ; lie Glu Cys Glu Ser 
235 240 i. 245 

cca aac aat gac etc age aga ttc cga ggc ttc eta gaa cat tec aac 1061 
Pro Asn Asn Asp Leu Ser Arg Phe Arg Gly Phe Leu Glu His Ser Asn 
250 ■ 255 260 

aaa gaa cgc gtg ggt etc agt aaa gaa aat ttg ttg ctt aga gga tgc 1109 
Lys Glu Arg Val Gly Leu Ser Lys Glu Asn Leu Leu Leu Arg Gly Cys 
265 ~ 270 "* 275 

ace att aga aac aca gag get gtt gtg ggc att gtg gtt tat gca ggc 1157 
Thr lie Arg Asn Thr Glu Ala Val Val Gly He Val Val Tyr Ala Gly 
280 285 290 295 

cat gaa ace aaa gca atg ctg aac aac agt ggg cea egg tat aag cgc 1205 
His Glu Thr Lys Ala Met Leu Asn Asn Ser Gly Pro Arg Tyr Lys Arg 
300 305 310 

age aaa tta gaa aga aga gca aac aca gat gtc etc tgg tgt gtc atg 1253 
Ser Lys Leu Glu Arg Arg Ala Asn Thr Asp Val Leu Trp Cys Val Met 
315 320 325 

ctt ctg gtc ata atg tgc tta act ggc gca gta ggt cat gga ate tgg 1301 
Leu Leu Val lie Met Cys Leu Thr Gly Ala Val Gly His Gly He Trp 
330 335 340 

ctg age agg tat gaa aag atg cat ttt ttc aat gtt ccc gag cct gat. 1349 
Leu Ser Arg Tyr Glu Lys Met His Phe Phe Asn Val Pro Glu Pro Asp 
345 350 355 

gga cat ate ata tea cca ctg ttg gca gga ttt tat atg ttt tgg ace 1397 
Gly His He He Ser Pro Leu Leu Ala Gly Phe Tyr Met Phe Trp Thr 
360 365 370 375 
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i 

atg ate att ttg tta cag gtc ttg att cct att tct etc tat gtt tec 1445 
Met lie He Leu Leu Gin Val Leu He Pro lie* Ser Leu Tyr Val Ser 
380 . 385 390 

■ ' • » » • • 

ate gaa, att gtg aag ctt gga caa ata tat ttc att caa agt gat gtg 1493 . 

He Glu He Val Lys Leu Gly Gin He Tyr Phe lie Gin Ser Asp Val 

395 400 , 405 

gat ttc tac aat gaa aaa atg gat tct att gtt cag tgc cga gec ctg 1541 
Asp Phe Tyr. Asn Glu Lys Met Asp Ser lie, Val Gin. Cys Arg Ala Leu 

410 , 415 420 ' 

■ ' • i » ' • 

■ aac ate gec gag gat ctg gga cag att cag tac etc ttt tec gat aag .1589 

Asn lie Ala Glu Asp Leu Gly Gin He Gin Tyr Leu Phe Ser Asp Lys 
425 . 4 3i0 435 

aca gga acc etc act gag aat aag atg gtt ttt cga aga tgt agt gtg 1637 
Thr Gly Thr Leu Thr Glu Asn Lys Met Val Phe Arg Arg Cys Ser Val 
440 ~ " 445 450 455 

' gca gga ttt gat tac tgc cat gaa gaa aat gee agg agg ttg gag tec 1685 
Ala Gly Phe Asp Tyr Cys His Glu Glu Asn Ala Arg Arg Leu Glu Ser 
460 465 470 

tat cag gaa get gtc tct gaa gat gaa gat ttt ata gac aca gtc agt 1733. 
Tyr Gin Glu Ala Val Ser Glu Asp Glu Asp Phe lie Asp Thr Val Ser 
475 480 • 485 

ggt tec etc age aat atg gca aaa ccg aga gee ccc age tgc agg aca 1781 
Gly Ser Leu Ser Asn Met Ala Lys Pro Arg Ala Pro Ser Cys Arg Thr 
490 4 95 500 

gtt cat aat ggg cct ttg gga aat aag ccc tea aat cat ctt get ggg 1829 
Val His Asn Gly Pro Leu Gly Asn Lys Pro Ser Asn His Leu Ala Gly 
.505 510 515 

age tct ttt act eta gga agt gga gaa gga gec agt gaa gtg cct cat 1877 
Ser Ser Phe Thr Leu Gly Ser Gly Glu Gly Ala Ser Glu Val Pro His 
520 525 530 535 

tec aga cag get get ttc agt age ccc att gaa aca gac gtg gta cca 1925 
.Ser Arg Gin Ala Ala Phe Ser Ser Pro lie Glu Thr Asp Val Val Pro 
540 545 550 

gac acc agg ctt tta gac aaa ttt agt cag att aca cct egg etc ttt 1973 
Asp Thr Arg Leu Leu Asp Lys Phe Ser Gin He Thr Pro Arg Leu Phe 
555 560 565 

atg cca eta gat gag acc ate caa aat cca cca atg. gaa act ttg tac 2021 

Met Pro Leu Asp Glu Thr lie Gin Asn Pro Pro Met Glu Thr Leu Tyr 
570 575 580 

i 

att ate gac ttt ttc att gca ttg gca att tgc aac aca gta gtg gtt. 2069 

He He Asp Phe Phe He Ala Leu Ala He Cys Asn Thr Val Val Val 
585 590 . 595 

tct get cct aac caa ccc cga caa aag ate aga cac cct tea ctg ggg 2117 
Ser Ala Pro Asn Gin Pro Arg Gin Lys He Arg His Pro Ser Leu Gly 
600 605 610 615 
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ggg ttg ccc att aag tct ttg gaa gag att aaa agt ctt ttc cag aga 2165. 
"Gly Leu Pro lie Lys ^e'r Leu ( (31u Glu I|e Lys Ser Leu Phe Gin Arg . 

620 625 ' 630 . 

tgg tct gtc cga aga tea agt tct cca teg ctt aac agt ggg aaa gag 2213 
Trp Ser Val • Arg Arg Ser Ser Ser Pro Ser Leu Asn Ser Gly Lys Glu . 
635 640 ' ' 645 

cca tct tct gga gtt cca aac gee ttt gtg age aga etc cct etc ttt 2261 
Pro Ser Ser Gly Val Pro Asn Ala Phe Val Ser Arg ,Leu Pro Leu Phe 
650 , 655 660 

agt cga atg aaa cca get tea cct gtg gag gaa gag gtc tec cag gtg 2309 
Ser Arg Met Lys 'Pro Ala Ser Pro Val Glu Glu Glu Val Ser Gin Val 
665 , , 670 675 

tgt gag age ccc cag tgc tec agt age tea get tgc tgc aca gaa aca 2357 
Cys Glu Ser Pro.. Gin Cys Ser Ser Ser Ser Ala Cys Cys Thr Glu Thr 
680 685 690 695 

gag aaa caa cac ggt gat gca ggc etc ctg aat ggc aag gca gag. tec 2405 
Glu Lys Gin His Gly Asp Ala Gly Leu Leu Asn Gly Lys Ala Glu Ser 

700 705 710 

i * 

i 

etc cct gga cag cca ttg gee tgc aac ctg] tgt tat ' gag gee gag age! 2453 
Leu Pro Gly Gl'n Pro. 'Leu Ala Cys Asn Leu Cys Tyr Glu Ala Glu Ser 
715 720 725 

cca gac gaa gcg gee tta gtg tat gee gee agg get tac caa tgc act 2501 
Pro Asp Glu Ala Ala Leu Val Tyr Ala Ala Arg Ala Tyr Gin Cys Thr 
730 735 740 

tta egg tct egg aca cca gag cag gtc atg gtg gacttt get get ttg 2549 
Leu Arg Ser Arg Thr Pro Glu Gin Val Met Val Asp Phe Ala Ala Leu 
745 750 755 

gga cca tta aca ttt caa etc eta cac ate ctg ccc ttt gac tea gta 2597 
Gly Pro Leu Thr Phe Gin Leu Leu His lie Leu Pro, Phe Asp Ser Val 
760 765 770 * 775 

aga aaa aga atg tct gtt gtg gtc cga cac cct ctt tec aat caa gtt 2 64 5 
Arg Lys Arg Met Ser Val Val Val Arg His Pro Leu Ser Asn Gin Val 
780 785 790 

gtg gtg tat acg aaa ggc get gat tct gtg ate atg gag tta ctg teg 2693 
Val Val Tyr Thr Lys Gly Ala Asp Ser Val lie Met Glu Leu Leu Ser 
795 800 805 

gtg get tec cca gat gga gca agt ctg gag aaa caa cag atg ata gta 2741 
Val Ala Ser Pro Asp Gly Ala Ser Leu Glu Lys Gin Gin Met lie Val 
810 815 820 

agg gag aaa ace cag aag cac ttg gat gac tat gee aaa caa ggc ctt 278 9 
Arg Glu Lys Thr Gin Lys His Leu Asp Asp Tyr Ala Lys Gin Gly Leu 
825 830 835 

cgt act tta tgt ata gca aag aag gtc atg agt gac act gaa tat gca 2837 
Arg Thr Leu Cys lie Ala Lys Lys Val Met Ser Asp Thr Glu Tyr Ala 
840 845 850 855 
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gag tgg ctg agg aat cat ttt tta get gaa acc age att gacaac agg 
Glu Trp Leu Arg 'Asn His Phe Leu Ala Glu Thr' Ser lie Asp As n Arg 
860 865 870 

gaa gaa, tta eta ctt gaa tct gee atg agg ttg gag aac aaa ctt aca 
Glu Glu Leu Leu Leu Glu Ser Ala Met Arg Leu Glu Asn Lys Leu Thr 
875 880 , 885 



2885 



2933 



tta ctt ggt get act ggc att gaa gac cgt.ctg cag gag gga gtc cct 
Leu Leu Gly Ala Thr Gly He Glu Asp Arg, Leu Gin Glu Gly Val Pro 
890 , 895 900 



2981 



gaa tct ata gaa get ctt cac aaa gcg ggc ate aag ate tgg atg ctg 

Glu Ser lie Glu Ala Leu His Lys Ala Gly lie Lys lie Trp Met Leu 
905 910 915 

'aca ggg gac aag cag gag aca get gtc aac ata get tat gca tgc aaa 

Thr Gly Asp Lys Gin Glu Thr Ala Val Asn He Ala Tyr Ala Cys Lys 

920 " 925 930 935 



3029 



3077 



eta ctg gag cca gat gac aag ctt ttt ate etc aat acc caa agt aaa 
Leu Leu Glu Pro Asp Asp Lys Leu Phe lie Leu Asn Thr Gin Ser Lys 
940 945 950 



3125 



gat gee tgt ggg atg ctg atg. age aca att ttg aaa gaa ctt cag aag 
Asp Ala Cys Gly Met Leu Met Ser Thr He Leu. Lys Glu Leu Gin Lys 
955 960 »■ 965 



3173 



aaa act caa gee ctg cca gag caa gtg tea tta agt gaa gat tta ctt 
Lys Thr Gin Ala Leu Pro Glu Gin Val Ser Leu Ser Glu Asp Leu Leu 
970 975 980 



3221 



cag cct cct gtc ccc egg gac tea ggg tta cga get gga etc att ate 
Gin Pro Pro Val Pro Arg Asp Ser Gly Leu Arg Ala Gly Leu He He 
985 990: 995 



3269 



act ggg aag acc ctg gag ttt gee ctg caa gaa agt ctg caa aag cag 3317 

Thr Gly Lys Thr Leu Glu Phe Ala Leu Gin Glu Ser Leu Gin Lys Gin 
1000 1005 1010 1015 

ttc ctg gaa ctg aca tct tgg tgt caa get gtg gtc tgc tgc cga gee 3365 

.Phe Leu Glu Leu Thr Ser Trp Cys Gin Ala Val Val Cys Cys Arg Ala 
1020 1025 1030 



aca ccg ctg cag aaa agt gaa gtg gtg aaa ttg gtc cgc age cat etc 
Thr Pro Leu Gin Lys Ser Glu Val Val Lys Leu Val Arg Ser His Leu 
1035 1040 1045 



3413 



cag gtg atg acc. ctt get att ggt gat ggt gee aat gat gtt age atg 
Gin Val Met Thr Leu Ala He Gly Asp Gly Ala Asn Asp Val Ser Met 
1050 1055 1060 



3461 



ata caa gtg gca gac att ggg ata ggg gtc tea ggt caa gaa ggc atg 
He. Gin Val Ala Asp He Gly He Gly Val Ser Gly Gin Glu Gly Met 
1065 1070 1075 



3509 



cag get gtg atg gee agt gac ttt gee gtt tct cag ttc aaa cat etc 
Gin Ala Val Met Ala Ser Asp Phe Ala Val Ser Gin Phe Lys His Leu 
1080 1085 1090 1095 



3557 
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age aag etc ctt ctt gtc cat gga cac tgg tgt tat aca egg ctt tec 3605 
Ser Lys Leu Leu Leu »Val His 'Gly His TrpCys Tyr Thr Ar$ Leu Ser ■ 
1100 1105 1110. 

aac atg att etc tat ttt ttc tat aag aat gtg gee tat gtg aac etc 3653 
Asn Met lie' Leu Tyr Phe Phe Tyr Lys Asn Val Ala Tyr Val Asn Leu 
1115 1120 ' 1125 

ctt ttc tgg tac cag ttc ttt tgt gga ttt tea gga aca tec atg act 3701 

Leu Phe Trp Tyr Gin Phe Phe Cys .Gly Phe Ser Gly 'Thr Ser Met Thr , 

1130 . 1135 . 1140 

i ' ' • 

gat tac tgg gtt ttg ate ttc ttc aac etc etc ttc aca tct gee cct 37 4 9 

Asp Tyr" Trp Val Leu lie Phe Phe Asn Leu Leu Phe Thr Ser Ala Pro 
1145 i ■ 1150 . 1155 

i ■ . 

cct gtc att tat ggt gtt ttg gag aaa gat gtg tct gca gag ace etc 3797 
Pro Val lie Tyr Gly Val Leu Glu Lys Asp Val Ser Ala Glu Thr Leu 
1160 * 1165 1170 1175 

atg eaa ctg cct gaa ctt tac aga agt ggt cag aaa tea gaij gca tac 3845 
Met Gin Leu Pro Glu Leu Tyr Arg Ser Gly Gin Lys Ser Glu Ala Tyr 

1180 , . : 1185 1190 

♦ 

i * 

tta ccc cat ace ttc tgg ate acc tta ttg gat get ttt tat eaa age 3893 
Leu Pro His Thr Phe Trp lie Thr Leu Leu Asp Ala Phe Tyr Gin Ser 
1195 1200 1205 

ctg gtc tgc ttc ttt gtg cct tat ttt ace tac. cag ggc tea gat act 3941 
Leu Val Cys Phe Phe Val Pro Tyr Phe Thr Tyr Gin Gly Ser Asp Thr 
1210 J.215 , 1220 

gac ate ttt gca ttt gga aac ccc ctg aac aca gee get ctg ttc ate 3989 
Asp lie Phe Ala Phe Gly Asn Pro Leu Asn Thr Ala Ala Leu Phe lie 
1225 1230 1235 

gtt etc etc cat ctg gtc att gaa age aag agt ttg act tgg att cac 4037 
Val Leu Leu His Leu Val He Glu Ser Lys Ser Leu » Thr Trp lie His 
1240 1245 1250 1255 

ttg ctg gtc ate att ggt age ate ttg tct tat ttt tta ttt gee ata 4085 
Leu Leu Val lie lie Gly Ser lie Leu Ser Tyr Phe Leu Phe Ala lie 
1260 1265 * 1270 

gtt ttt gga gee atg tgt gta act tgc aac cca cca tec aac cct tac 4133 
Val Phe Gly Ala Met Cys Val Thr Cys Asn Pro Pro Ser Asn Pro Tyr 
1275 1280 1285 

tgg att atg cag gag cac atg ctg gat cca gta ttc tac tta gtt tgt 4181 
Trp lie Met Gin Glu His Met Leu Asp Pro Val Phe Tyr Leu Val Cys 
1290 1295 1300 

ate etc acg acg tec att get ctt ctg ccc agg ttt gta tac aga gtt 4229 
lie Leu Thr Thr Ser lie Ala Leu Leu Pro Arg Phe Val Tyr Arg Val 
1305 1310 1315 

ctt cag gga tec ctg ttt cca tct cca att ctg aga get aag cac ttt 4277 
Leu Gin Gly Ser Leu Phe Pro Ser Pro lie Leu Arg Ala Lys His Phe 
1320 1325 1330 1335 
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gac aga cta : act cca gag gag agg act aaa get etc aag aag tgg aga 4325 

Asp Arg Leu * Thr" Pro ,Glu Glu » Arg Thr Lys Ala Leu Lys Lys Trp Arg ■ 
- . 1340 . 1345 - 1350. 

■ • , f 

ggg get gga aag atg aat caa gtg aca ,t,ca aag tat get aac caa tea 4 373 

Gly Ala Gly'Lys Met Asn Gin Val Thr Ser Lys Tyr Ala Asn Gin Ser 

1355 1360 1365 



get ggc aag tea 
Ala Gly Lys Ser 
1370 



gga aga aga ccc atg cct 
Gly Arg Arg Pro Met Pro 
1375 



ggc cct tct get gta ttt 
Gly Pro i Ser Ala Val Phe 
1380 



4421 



gca atg aag tea 
Ala Met Lys Ser 
1385 i 

ctg tgt gaa act, 
Leu Cys Glu Thr 
1400 



gca act tee tgt get att 
'Ala. Thr Ser Cys Ala lie 
1390 
» . * 
, get tta gat caa ggc tac 
Ala Leu Asp Gin Gly Tyr 
1405 



gag caa gga aac tta tct 
Glu Gin Gly Asn Leu Ser 
1395 

tct gaa act aag gee ttt 
Ser Glu Thr Lys Ala Phe 
1410 1415 



4469 



4517 



gag atg get gga. ccc tec aaa ggt aaa gaa age tag ataccctcct 
Glu Met Ala Gly Pro Ser Lys Gly Lys Glu Ser * 
1420 , , . 1425 



4563 



tggagttgca 
caagaatgac 
catgatgagc 
tagagagagt 
ttggattttg 
tttgagatat 
aggctccaat 
gaggaaatac 
ttaaacaaca 
tattaatagc 
gcatacctct 
gagggattct 
gagaagacct 
ataggtcagg 
actgacagta 
ttgaaggttc 
atgagatata 
aaaaatttgg 
gcgaatcatg 
gctactataa 
tatttgaaag 
atgcaatgtc 
cttctgccta 
tcttacataa 
ctgetttett 
gggcggccgc 



agt;attcttt 
ttgtttttcc 
attattgtat 
ttaaggaagt 
taaaagcatt 
atttatttta 
eggaattttt 
ggaaaaaaca 
aggtacctaa 
tttttatttc 
cactcatttc 
agggtttcat 
tttatgagat 
cagacatcag 
gectgatcaa 
atagttttta 
aacagaaact 
ttttaaaatc 
tatgetgget 
aatcatatat 
ttgtcagcca 
tacatcaagg 
ctattttggt 
gctgatttcg 
atgactgttt 



caaggttgga 
ataagggaca 
gtttgtatat 
gaaatatt ta 
ttcattctct 
ctaggagatc 
ttaagaaaaa 
ttgtagatgg 
aatagggtat 
ctatgggaag 
aatgttttcc 
cagggaccag 
ctgcctctgt 
ctcagcctgt 
agtgatacaa 
caataccctg 
aatacaagtt 
atttgaggac 
ttagttgtaa 
tcctgttgaa 
ccagtcatcc 
tcttcttaat 
ctgacatttt 
agaactttca 
ttataaataa 



agagggattt 
tgagcatttt 
acatttgtga 
attcagaacc 
tagaaattca 
t.tatattcta 
agtagttttt 
ttaatttaca 
aattatactg 
atgcttttgg 
tgaggtggag 
gaatgeatte 
atagatgttg 
ggcccattgg 
tcaatttcaa 
agacttttca 
gttctctgga 
aggaatgtct 
caaacgattt 
gttcttttga. 
agaatttect 
gactattatt 
tgtagectte 
aaatctcaca 
. actgtttcat 



tgaagaggta 
actaggcttg 
tagagggcta 
aaatgctttt 
agtattttca 
gggaaatget 
aatacattgg 
gataaaatcc 
cttaaaatac 
tcttctggct 
ccttcattgg 
ctctgtcagg 
tcagttagga 
gtgatttcct 
aacaatcttc 
ggtgttggag 
ggtttctatg 
atagcaagtt 
tattctaagt 
agtcctatct 
tcctgaatct 
ctcagggttt 
tgttattatt 
tagctaatgg 
aaataaaaaa 



tct ctccaag 
gaagagctga 
gagtttgacc 
gtaaaacttt 
aggggagtca 
ttaaatggtc 
ttaggactca 
caagagcett 
agttagtgcc 
gaagatgtag 
aaaggggaaa 
ttccaatcaa 
aactgaagee 
gtattttaaa 
cagagaccac 
cctctaaaat 
aggttcttag 
tactcctatt 
aaggccaggt 
ctatttatta 
ccatgctcat 
agttttctac 
ggaaatagtc 
aagttgcttt 
aaaaaaaaaa 



<210> 17 
<211> 1426 
<212> PRT 

<213> Homo sapiens 
<400> 17 

Met Thr Glu Ala Leu Gin Trp Ala Arg Tyr His Trp Arg Arg Leu He 

1 5 10 15 

Arg Gly Ala Thr Arg Asp Asp Asp Ser Gly Pro Tyr Asn Tyr Ser Ser 
20 25 30 
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4 623 
4683 
4 7.4 3 
4803 
4863 
4923 
4983 
5043 
5103 
5163 
5223 
5283 
5343 
5403 
5463 
5523 
5583 
5643 
5703 
5763 
5823 
5883 
5943 
6003 
6063 
6073 
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Leu . Leu Ala Cys Gly Arg Lys Ser Ser Gin lie Pro Lys Leu Ser Gly 

35 40 • ■ ■ 45 

Arg His Arg lie Val Val Pro His lie, Gin Pro Phe Lys Asp Glu Tyr 

Glu Lys Phe Ser Gly Ala Tyr Val Asn Asn Arg lie Arg Thr Thr Lys 
65 70 75 ( 80 

Tyr Thr Leu Leu Asn Phe Val Pro Arg Asn Leu Phe Glu Gin Phe His 

85 90 . . 95 

Arg Ala Ala Ser . Leu Tyr Phe Leu Phe Leu Val Val Leu Asn Trp Val 

100 105, 110 

Pro Leu Val Glu Ala Phe Gin Lys Glu lie Thr Met Leu Pro Leu Val 

115 ■ ' 120 , 125. 

Val Val Leu Thr lie lie Ala He Lys Asp Gly Leu Glu Asp Tyr Arg 

130 135 140 

Lys Tyr Lys He Asp Lys Gliji He Asn Asn Leu lie Thr Ly?> Val Tyr 
145 150 155 . 160 

Ser Arg Lys Glu Lys Lys Tyr He Asp Arg Cys Trp Lys Asp Val Thr 

165 ' 170 175 

Val Gly Asp Phe "lie Arg Leu Ser Cys Asn Glu Val He Pro Ala Asp 

180 185 190 

'Met Val Leu Leu Phe Ser Thr Asp Pro Asp Gly lie Cys His He Glu 

195 200 205 

Thr Ser Gly Leu Asp Gly Glu Ser Asn Leu Lys Gin Arg Gin Val Val 

210 215 • 220 

Arg Gly Tyr Ala Glu Gin Asp Ser Glu Val Asp Pro Glu Lys Phe Ser 
225 230 235 240 

Ser Arg He Glu Cys Glu Ser Pro Asn Asn Asp Leu Ser Arg Phe Arg 

245 250 255 

Gly Phe Leu Glu His Ser Asn Lys Glu Arg Val Gly Leu Ser Lys Glu 

260 265 270 

Asn Leu Leu Leu Arg Gly Cys Thr lie Arg Asn Thr Glu Ala Val Val 

275 " ~ 280 285 

Gly lie Val Val Tyr Ala Gly His Glu Thr Lys Ala Met Leu Asn Asn 

290 295 300 

Ser Gly Pro Arg Tyr Lys Arg Ser Lys Leu Glu Arg Arg Ala Asn Thr 
305 310 315 320 

Asp Val Leu Trp Cys Val Met Leu Leu Val He Met Cys Leu Thr Gly 

325 330 335 

Ala Val Gly His Gly lie Trp Leu Ser Arg Tyr Glu Lys Met His Phe 

340 345 350 

Phe Asn Val Pro Glu Pro Asp Gly His He He Ser Pro Leu Leu Ala 

355 360 365 

Gly Phe Tyr Met Phe Trp Thr Met He He Leu Leu Gin Val Leu He 

370 375 .380 

Pro He Ser Leu Tyr Val Ser He Glu He. Val Lys Leu Gly Gin He 
385 390 395 400 

Tyr Phe He Gin Ser Asp Val Asp Phe Tyr Asn Glu Lys Met Asp Ser 

405 410 415 

He Val Gin Cys Arg Ala Leu Asn He Ala Glu Asp Leu Gly Gin He 

420 425 430 

Gin Tyr Leu Phe Ser Asp Lys Thr Gly Thr Leu Thr Glu Asn Lys Met 

435 440 445 

Val Phe Arg Arg Cys Ser Val Ala Gly Phe Asp Tyr Cys His Glu Glu 

450 455 4 60 

Asn Ala Arg Arg Leu Glu Ser Tyr Gin Glu Ala Val Ser Glu Asp Glu 
465 470 475 480 

Asp Phe He Asp Thr Val Ser Gly Ser Leu Ser Asn Met Ala Lys Pro 

485 490 495 

Arg Ala Pro Ser Cys Arg Thr Val His Asn Gly Pro Leu Gly Asn Lys 
500 505 510 
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Pro Ser Asn His Leu Ala Gly Ser Ser Phe Thr Leu Gly Ser Gly Glu 

515 520 • 525 

Gly Ala Ser Glu Val Pro His Ser Arg Gin Ala Ala Phe Ser Ser Pro 





530 










535 




♦ 








• 


• 




He 


Glu 


Thr 


Asp 


Val 


Val 


Pro 


Asp 


Thr Arg Leu 


Leu 


Asp Lys 


Phe 


Ser 


545 










550 










555 




Thr lie 




560 


Gin 


lie 


Thr 


Pro 


Arg 


Leu 


Phe 


Met 


Pro Leu Asp Glu 


Gin 


Asn 










565 










570 








575 




Pro ' 


Pro 


Met 


Glu 


Thr 


Leu 


Tyr 


He 


He Asp 


Phe 


Phe 


He Ala 


Leu 


Ala 








580 










58^ 








590 






He 


Cys 


Asn 


Thr 


Val . 


Val 


Val 


Ser 


Ala 


Pro Asn Glri 


Pro Arg 


Gin 


Lys 






595 










600 






i 




605 




He. 


Arg 


His 


Pro 


Ser 


Leu 


Gly 


Gly Leu 


Pro 


lie 


Lys 


Ser Leu 


Glu 


Glu. 




610 










615 










620 








He 


Lys 


Ser 


Leu 


Phe 


Gin 


Arg 


Trp Ser Val Arg Arg 


Ser Ser 


Ser 


Pro 


625 










630 










635 


* 






640 


Ber 


Leu 


Asn 


Ser 


Gly 
645 


Lys 


Glu 


Pro 


Ser 


Ser 
650 


Gly" 


Val 


Pro Asn 


Ala 
655 


Phe 


Val 


Ser 


Arg 


Leu 
660 


Pro 


Leu 


Phe 


Ser 


Arg 
665 


Met 


Lys 


Pro 


Ala Ser 
670 


Pro 


Val 


• Glu 


Glu 


Glu 
675 


Val 


Ser 


Gin 


Val 


Cys 
680 


Glu 


Ser 


Pro 


Gin 


Cys Ser 
685 


Ser 


Ser 


Ser 


Ala 
690 


Cys 


Cys 


Thr 


Glu 


Thr 
695 


Glu 


Lys 


Gin 


His 


Gly 
700 


Asp Ala 


Gly 


Leu 


Leu 


Asn 


Gly Lys 


Ala 


Glu 


Ser 


Leu 


Pro 


Gly Gin 


Pro 


Leu Ala 


Cys 


Asn 


705 










710 










715 








720 


Leu 


Cys 


Tyr 


Glu 


Ala 
725 


Glu 


Ser 


Pro 


Asp 


Glu 
730 


Ala 


Ala 


Leu Val 


Tyr 
735 


Ala 


Ala 


Arg 


Ala 


Tyr 
740 


Gin 


Cys 


Thr 


Leu 


Arg 
745 


Ser 


Arg 


Thr 


Pro Glu 
750 


Gin 


Val 


Met 


Val 


Asp 
755 


Phe 


Ala 


Ala 


Leu 


Gly 
760 


Pro 


Leu 


Thr 


Phe 


Gin Leu 
7 65 


Leu 


His 


He 


Leu 
770 


Pro 


Phe 


Asp 


Ser 


Val 
775 


Arg 


Lys 


Arg 


Met 


Ser 
780 


Val Val 


Val 


Arg 


His 


Pro 


Leu 


Ser 


Asn 


Gin 


Val 


Val 


Val 


Tyr 


Thr 


Lys 


Gly Ala 


Asp 


Ser 


785 










790 










795 








800 


Val 


lie 


Met 


Glu 


Leu 


Leu 


Ser 


Val 


Ala 


Ser 


Pro 


Asp 


Gly Ala 


Ser 


Leu 


Glu 








805 










810 








815 




Lys 


Gin 


Gin 
820 


Met 


lie 


Val 


Arg 


Glu 
825 


Lys 


Thr 


Gin 


Lys His 
830 


Leu 


Asp 


. Asp 


Tyr 


Ala 


Lys 


Gin 


Gly Leu 


Arg 


Thr 


Leu 


Cys 


lie 


Ala Lys 


Lys 


Val 






835 










840 










845 






Met 


Ser 


Asp 


Thr 


Glu 


Tyr 


Ala 


Glu 


Trp 


Leu Arg 


Asn 


His Phe 


Leu 


Ala 




850 










855 










860 








Glu 


Thr 


'Ser 


He 


Asp 


Asn 


Arg 


Glu 


Glu 


Leu 


Leu 


Leu 


Glu Ser 


Ala 


Met 


8 65 










870 










875 








880 


Arg 


Leu 


Glu 


Asn 


Lys 


Leu 


Thr 


Leu 


Leu 


Gly Ala 


Thr 


Gly He 


Glu 


Asp 










885 










890 








895 




Arg 


Leu 


Gin 


Glu 


Gly 


Val 


Pro 


Glu 


Ser 


He 


Glu 


Ala 


Leu His 


Lys 


Ala 






900 










905 








910 






Gly 


He 


Lys 
915 


He 


Trp 


Met 


Leu 


Thr 
920 


Gly 


Asp 


Lys 


Gin 


Glu Thr 
925 


Ala 


Val 


Asn 


He 
930 


Ala 


Tyr 


Ala 


Cys 


Lys 
935 


Leu 


Leu 


Glu 


Pro 


Asp 
940 


Asp Lys 


Leu 


Phe 


lie 


Leu 


Asn 


Thr 


Gin 


Ser 


Lys 


Asp Ala 


Cys 


Gly Met 


Leu Met 


Ser 


Thr 


945 










950 










955 








960 


He 


Leu 


Lys 


Glu 


Leu 


Gin 


Lys 


Lys 


Thr 


Gin 


£la 


Leu 


Pro Glu 


Gin 


Val 










965 










970 






975 




Ser 


Leu 


Ser 


Glu 
980 


Asp 


Leu 


Leu 


Gin 


Pro 
985 


Pro 


Val 


Pro 


Arg Asp 
990. 


Ser 


Gly 
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Leu Arg Ala Gly Leu He He Thr Gly Lys Thr Leu Glu Phe Ala Leu 

995 -::.v . . ..M. „1000 . • 1005 » . 

Gin Glu Ser Leu Gin Lys Gin Phe Leu Glu Leu Thr Se r Trp Cys Gin 
~~ 1010 ~- ~^ TOp " 1020 § P 

Ala Val Val Cys Cys Arg Ala Thr Pro ^eu Gin Lys Ser Glu Val Val 
1025 ' 1030 1035 . ; ' 1040 

Lys Leu Val Arg Ser His Leu Gin Val Met Thr Leu Ala He Gly Asp 

1045 1050 1055 

Gly Ala Asn Asp Val Ser Met He Gin Val Ala Asp He Gly He Gly 

1060 1065 > 1070 

Val Ser Gly Gin Glu Gly, Met Gin Ala Val Met, Ala Ser Asp Phe Ala 

1075 i 1080 ' 1 . 1085 

Val Ser Gin Phe Lys His Leu Ser Lys Leu Leu Leu Val His Gly His 

1090 . 1095 . 1100 

Trp Cys Tyn Thr Arg Leu Ser Asn Met lie Leu Tyr Phe Phe Tyr Lys 
1105 ^ .1110 "■ 1115 1120 

Asn Val Ala Tyr Val Asn Leu Leu Phe Trp Tyr Gin Phe Phe Cys Gly 

, .1125 . 1130 1135 

Phe Ser Gly Thr Ser Met Thr Asp Tyr Trp Val Leu He Phe Phe Asn. 

. 1140 '• 1145 1150 . 

Leu Leu Phe Thr Ser Ala Pro Pro Val He Tyr Gly Val Leu Glu Lys 

1155 1160 ' 1165 

Asp Val Ser Ala Glu Thr Leu, Met Gin Leu Pro Glu Leu Tyr Arg Ser 

1170 1175 1180 , . 

Gly Gin Lys Sejr Glu Ala Tyr Leu Pro His Thr Phe' Trp He Thr Leu 
1185 ' . '1190 1195 , 1200 

Leu Asp Ala Phe Tyr Gin Ser Leu Val Cys Phe Phe Val Pro Tyr Phe 

1205 1210 1215 

Thr Tyr Gin Gly Ser Asp Thr Asp He Phe Ala Phe Gly Asn Pro Leu 

1220 12.25 1230 

Asn Thr Ala Ala Leu Phe lie Val Leu Leu His Leu Val lie Glu Ser 

1235 1240 ■ ,1245 

Lys Ser Leu Thr Trp He His Leu Leu Val He He Gly Ser lie Leu 

1250 1255 1260 

Ser Tyr Phe Leu Phe Ala He Val Phe Gly Ala Met Cys Val Thr Cys 
1265 1270 1275 1280 

Asn Pro Pro Ser Asn Pro Tyr Trp lie Met Gin Glu His Met Leu Asp 

1285 1290 , 1295 

Pro Val Phe Tyr Leu Val Cys He Leu Thr Thr Ser lie Ala Leu Leu 

1300 1305 1310 

Pro Arg Phe Val Tyr Arg Val Leu Gin Gly Ser Leu Phe Pro Ser Pro 

1315 1320 1325 

He Leu Arg Ala Lys His Phe Asp Arg Leu Thr Pro Glu Glu Arg Thr 

1330 " 1335 . - 1340 

Lys Ala Leu Lys Lys Trp Arg Gly Ala Gly Lys Met Asn Gin Val Thr 
1345 * 1350 1355 1360 

Ser Lys Tyr Ala Asn Gin Ser Ala Gly Lys Ser Gly Arg Arg Pro Met 

1365 1370 1375 

Pro Gly Pro Ser Ala Val Phe Ala Met Lys Ser Ala Thr Ser Cys Ala 

1380 1385 1390 

He Glu Gin Gly Asn Leu Ser Leu Cys Glu Thr Ala Leu Asp Gin Gly 

1395 1400 . 1405 

Tyr Ser Glu Thr Lys Ala Phe Glu Met Aia Gly Pro Ser Lys Gly Lys 

1410 1415 1420 

Glu Ser • 
1425 



<210> 18 
<211> 4281 
<212> DNA 
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<213> Homo -sapiens 

<220> • 
<221> CUS~ . , ■ . , , 

<222> (1) . . . 14281) ; , 

<400> 18 

atg act gag get etc caa tgg gec aga tat cac tgg cga egg ctg ate 

Met Thr Glu Ala Leu Gin Trp Ala Arg Tyr His Trp Arg Arg Leu He 

1 5 10 ,15 



agg cac egg att gtt gtt ccc cac ate cag ccc ttc aag gat gag tat 
Arg His Arg' lie Val Val Pro His He Gin Pro Phe Lys hsp Glu Tyr 
50 55 ■ ' 60 

gag aag ttc tec gga gee tat' gtg 'aac aat cga ata cga aca. aca aag 
Glu Lys Phe Ser Gly Ala Tyr Val Asn As n Arg He Arg Thr Thr Lys 1 
65 '70 75 , 80 

tac aca ctt ctg aat ttt gtg cca aga aat tta ttt gaa caa ttt cac 
Tyr Thr Leu Leu Asn Phe Val Pro Arg Asn Leu Phe Glu Gin Phe His 
85 90 95 



agt agg aaa gag aaa aaa tac att gac cga tgc tgg aaa gac gtt act 
Ser Arg Lys Glu Lys Lys Tyr He Asp Arg Cys Trp Lys Asp Val Thr 
165 170 175 



atg gta eta etc ttt tec act gat cca gat gga ate tgt cac att gag 
Met Val Leu Leu Phe Ser Thr Asp Pro Asp Gly He Cys His He Glu 
195 200 205 



48 



aga ggt gca ace agg gat gat} gat tea ggg cca tac aac tat tec teg 96 

Arg Gly Ala Thr Arg Asp Asp Asp Ser Gly Pro Tyr Asn Tyr Ser Ser 

20 ' 25 30 

.1 • • 

ttg etc gee tgt ggg ,cgc aag tec tct cag ate cct aaa ctg tea gga 14 4 

Leu Leu Ala Cys Gly Arg Lys Ser Ser Gin He Pro Lys Leu Ser Gly 
35 40 45 



192 



240 



288 



aga get gec agt tta tat ttc ctg ttc ct;a gtt gtc ctg aac tgg gta 336 

Arg Ala Ala Ser Leu Tyr Phe Leu Phe Leu Val Val Leu Asn Trp Val 
100 105 110 

cct ttg gta gaa gee ttc caa aag gaa ate ace atg . ttg cct ctg gtg 384 

Pro Leu Val Glu Ala Phe Gin Lys Glu He Thr Met Leu Pro Leu Val 
115 120 ' ,125 

gtg gtc ctt aca att ate gca att aaa gat ggc ctg gaa gat tat egg 

Val Val Leu Thr He He Ala He Lys Asp Gly Letu Glu Asp Tyr Arg 

130 135 140 

aaa tac aaa att gac aaa cag ate aat aat tta. ata act aaa gtt tat 480 

Lys Tyr Lys He Asp Lys Gin He Asn Asn Leu He Thr Lys Val Tyr 
145 150 155 160 



432 



528 



gtt ggg gac ttt att cgc etc tec tgc aat gag gtc ate cct gca gac 57 6 
Val Gly Asp Phe lie Arg Leu Ser Cys Asn Glu Val lie Pro Ala Asp 
180 185 190 



624 
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act tct ggt ctt gat gga gag age aat tta aaa cag agg cag gtg gtt 672 
Thr Ser Gly Leu Asp- Gly Glu Ser Asn Leu Lys Gin Arg Gin. Val -.Val 
210 - 215 ' 220 

egg gga tat gca gaa.cag. gac tct gaa gtt gat' cct gag a&g ttt tec 720 
Arg Gly Tyr Ala Glu Gin Asp Ser Glu Val Asp Pro Glu Lys Phe Ser 
225 230 235 • . 240 

agt agg ata gaa tgt gaa age cca aac aat gac etc age aga ttc cga 7 68 
Ser Arg lie Glu Cys Glu Ser Pro Asn Asn Asp Leu Ser Arg Phe Arg 

245 ■ 250- ; ■ 255 

■ • i 

ggc ttc eta gaa cat tec aac aaa gaa cgc gtg ggt etc agt aaa gaa 816 

Gly Phe Leu Glu His Ser Asn Lys Glu Arg Val Gly Leu Ser Lys Glu 

260 265 270 

1 ■ ; 

aat ttg ttg ctt aga gga tgc ace att aga aac • aca gag get gtt gtg 8 64 

Asn Leu Leu Leu Arg Gly Cys Thr lie Arg Asn Thr Glu Ala Val Val 
275 * 280 285 

ggc att gtg gtt tat gca ggc cat gaa acc aaa gca atg ctg aac aac 912 
Gly lie Val Val Tyr Ala Gly His Glu Thr Lys Ala Met Leu Asn Asn 
290 295 300 

agt ggg cca egg tat aag cgc age aaa tta gaa aga aga gca aac aca 960 
Ser Gly Pro Arg Tyr Lys Arg Ser Lys Leu Glu Arg Arg Ala Asn Thr 
305 310 315 320 

gat gtc etc tgg tgt gtc atg ctt ctg gtc ata atg tgc tta act ggc 1008 
Asp Val Leu Trp Cys Val Met Leu Leu Val He Met Cys Leu Thr Gly 
325 330 .335 

gca gta ggt cat gga ate tgg ctg age agg tat gaa aag atg cat ttt 1056 
Ala Val Gly His Gly He Trp Leu Ser Arg Tyr Glu Lys Met His Phe 
340 345 o 350 

ttc aat gtt ccc gag cct gat gga cat ate ata tea cca ctg ttg gca 1104 
Phe Asn Val Pro Glu Pro Asp Gly His He He Ser Pro Leu Leu Ala 
355 360 365 

gga ttt tat atg ttt tgg acc atg ate att ttg tta cag gtc ttg att 1152 
Gly Phe Tyr Met Phe Trp Thr Met He lie Leu Leu Gin Val Leu lie 
370 375 380 

cct att -tct etc tat gtt tec ate gaa att gtg aag ctt gga caa ata 1200 
Pro lie Ser Leu Tyr Val Ser He Glu lie Val Lys Leu Gly Gin He 
385 390 395 400 

tat ttc att caa agt gat gtg gat ttc tac aat gaa aaa atg gat tct 1248 
Tyr Phe He Gin Ser Asp Val Asp Phe Tyr Asn Glu Lys Met Asp Ser 
405 410 415 

att gtt cag tgc cga gee ctg aac ate gee gag gat ctg gga cag att 1296 
He Val Gin Cys Arg Ala Leu Asn He Ala Glu Asp Leu Gly Gin lie 
420 425 430 

cag tac etc ttt tec gat aag aca gga acc etc act gag aat aag atg 1344 
Gin Tyr Leu Phe Ser Asp Lys Thr Gly Thr Leu Thr Glu Asn Lys Met 
435 440 445 
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gtt ttt cga' aga tgt agt gtg gca gga ttt gat tac tgc cat gaa gaa 1392. 
Val Phe Arg Arg Cys per Val Ala Gly -Phe Asp Tyr Cys His Glu Gl,u . 
450 455 . . ' 460 

aat gcc agg agg ttg gag tec tat cag gaa get gtctct gaa gat gaa 1440 
Asn Ala Arg 'Arg Leu Glu Ser Tyr Gin Giu Ala Val Set Glu Asp Glu . 
465 470 .475 '] ' 480 

gat ttt ata gac aca gtc agt ggt tec etc age aat atg gca aaa ccg 1488 

Asp Phe lie Asp Thr Val Ser Gly Ser Leu Ser Asn frtot Ala Lys Pro 

485 490 495 

i ... . 

aga gcc ccc age tgc agg aca gtt cat aat ggg cct ttg gga aat aag 1536 

Arg Ala Pro Ser Cys Arg Thr Val His Asn Gly Pro Leu Gly Asn Lys 

,500 " " , 505 510 

ccc tea aat cat ctt get ggg age tct ttt act eta gga agt gga gaa 1584 
Pro Ser Asri His Leu Ala Gly Ser Ser Phe Thr Leu Gly Ser Gly Glu 
515 520 525 

gga gcc agt gaa gtg cct cat tec aga cag get get ttc agt age ccc 1632 

Gly Ala Ser Glu Val Pro His Ser Arg Gin Ala Ala Phe Ser Ser Pro 
530 535 ( , : 540 

i 

att gaa aca gac gtg gta cca gac ace agg' ctt tta 'gac aaa ttt agt' 1680 

lie Glu Thr Asp Val Val Pro Asp Thr Arg Leu Leu Asp Lys Phe Ser 
545 550 . 555 560 



cag att aca cct egg etc ttt atg cca eta gat gag acc ate caa aat 1728 

Gin lie Thr Pro Arg Leu. Phe Met Pro Leu Asp Glu Thr lie Gin Asn 

565 - 570 575 

cca cca atg gaa act ttg tac att ate gac ttt ttc att gca ttg gca 1776 

Pro Pro Met Glu Thr Leu Tyr lie lie Asp Phe Phe lie Ala Leu Ala 

580 585 590 



att tgc aac aca gta gtg gtt tct get cct aac caa ccc cga caa aag 1824 

lie Cys Asn Thr Val Val Val Ser Ala Pro Asn Gin ,Pro Arg Gin Lys 
595 . 600 605 

ate aga cac cct tca-ctg ggg ggg ttg ccc att aag tct ttg gaa gag 1872 

lie Arg His Pro Ser Leu Gly Gly Leu Pro He Lys Ser Leu Glu Glu 

610 615 620 



att aaa agt ctt ttc cag aga tgg tct gtc cga aga tea agt tct cca 1920 

He Lys Ser Leu Phe Gin Arg Trp Ser Val Arg Arg Ser Ser Ser Pro 
625 630 635 640 

teg ctt aac agt ggg aaa gag cca tct tct gga gtt cca aac gcc ttt 1968 

Ser Leu Asn Ser Gly Lys Glu Pro Ser Ser Gly Val Pro Asn Ala Phe 

645 - 650 655 

gtg age aga etc cct etc ttt agt cga atg aaa cca get tea cct gtg 2016 

Val Ser Arg Leu Pro Leu Phe Ser Arg Met Lys Pro Ala Ser Pro Val 
660 665 670 

gag gaa gag gtc tec cag gtg tgt gag age ccc cag tgc tec agt age 2064 

Glu Glu Glu Val Ser Gin Val Cys Glu Ser Pro Gin Cys Ser Ser Ser 
675 680 685 
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tca ^c€ : ^gc tgcaca gaa aca gag aaa caa • cac .ggt gat gca ggc . etc .2112 
Sex-- Ala -Cys Cys Thr Glu Thr Glu Lys Gin Hi s^Gly. Asp- Ala Gly Leu 
.690 :". 695 , -700 . .. 

ctg aat ggc aag gca gag tec etc cct gga.cag cca ttg gec tgc aac 2160 

Leu Asn Gly Lys Ala Glu Ser Leu Pro Gly Gin Pro Leu Ala.Cys Asn 

705 710 715 t : • 720 

ctg tgt tat gag gee gag age cca gac gaa gcg gee tta gtg tat gee 2208 

Leu Cys Tyr Glu Ala Glu Ser Pro Asp Glu Ala Ala Leu Val Tyr Ala 

725 730' : 735 

•» 

- i » 

gee agg get tac caa tgc act tta egg tct egg aca cca gag cag gtc .2256 

Ala Arg Ala Tyr Gin Cys Thr Leu Arg Ser Arg Thr Pro Glu Gin Val 

740 . 745 750 



'atg gtg gac ttt get get ttg gga cca tta aca ttt caa etc eta cac 
Met Val Asp Phe Ala Ala Leu Gly Pro Leu Thr Phe Gin Leu Leu His 
755 * 760 765 



2304 



ate ctg ccc ttt gac tea gta aga aaa aga atg tct gtt gtg gtc cga 2352 
lie Leu Pro Phe Asp Ser Val Arg Lys Arg Met Ser Val Val Val Arg 
770 775 780 

cac cct ctt tec aat caa gtt gtg gtg tat acg aaa ggc get gat tct 2400 
His Pro Leu Ser Asn Gin Val Val Val Tyr Thr Lys Gly Ala Asp Ser 
785 790 795 . 800 

gtg ate atg gag tta ctg teg gtg get tec cca gat gga gca agt ctg 2448 
Val lie Met Glu Leu Leu Ser Val Ala Ser Pro Asp Gly Ala Ser Leu 
805 810 815 

gag aaa caa cag atg ata gta agg gag aaa ace cag aag cac ttg gat 24 96 
Glu Lys Gin Gin Met lie Val Arg Glu Lys Thr Gin Lys His Leu Asp 
820 825 830 

gac tat gec aaa caa ggc ctt cgt act tta tgt ata gca aag aag gtc 2544 
Asp Tyr . Ala Lys Gin Gly Leu Arg Thr Leu Cys lie Ala Lys Lys Val 
835 840 845 

atg agt gac act gaa tat gca gag tgg ctg agg aat cat ttt tta get 2592 
Met Ser Asp Thr Glu Tyr Ala Glu Trp Leu Arg Asn His Phe Leu Ala 
850 855 860 

gaa acc "age att gac aac agg gaa gaa tta eta ctt gaa tct gee atg 2640 
Glu Thr Ser lie Asp Asn Arg Glu Glu Leu Leu Leu Glu Ser Ala Met 
865 870 875 880 

agg ttg gag aac aaa ctt aca tta ctt ggt get act ggc att gaa gac 2688 
Arg Leu Glu Asn Lys Leu Thr Leu Leu Gly Ala Thr Gly He Glu Asp 
885 890 895 

cgt ctg cag gag gga gtc cct gaa tct ata gaa get ctt cac aaa gcg 2736 
Arg Leu Gin Glu Gly Val Pro Glu Ser He Glu Ala Leu His Lys Ala 
900 905 910 

ggc ate aag ate tgg atg ctg aca ggg gac aag cag gag aca get gtc 2784 
Gly He Lys He Trp Met Leu Thr Gly Asp Lys Gin Glu Thr Ala Val 
915 920 925 
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aac at a get tat gca~t:gc.aaa eta ctg gag cca gat gac a'agctt ttt 
Asn lie Ala Tyr Ala Cys- Lys Leu Leu Glu Pro, Asp Asp Lys Leu Phe 
930 - 935 ^4 0 



2832 



ate etc aat ace caa agt aaa gat gee tgt ggg atg ctg atg age aca 
lie Leu* Asn Thr Gin Ser Lys Asp Ala Cys Gly- Met Leu Met Ser Thr 
945 950 955 ' 960 



2880 



att ttg aaa gaa ctt cag aag aaa act caa gee ctg cca gag caa gtg 
He Leu Lys Glu Leu Gin Lys Lys Thr Gin Ala Leu Pro Glu Gin Val 
" . 965 970' ; 975 



2928 



tea tta agt gaa gat tta ctt cag cct cct gtc ccc egg gac tea ggg 
Ser Leu Ser Glu Asp Leu Leu Gin Pro Pro Val Pro Arg Asp Ser Gly 
980 , 985 990 



2976 



tta cga get gga etc att ate act ggg aag ace : ctg gag ttt gee ctg 
Leu Arg Ala Gly Leu He lie ^hr Gly Lys Thr Leu Glu Phe Ala Leu 
995 * 1000 1005 



3024 
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cjat gtg tct. gca gag acc etc atg caa ctg 'c.ct./gaa' ctt tac aga agt 3552. 
Asp Val Ser Ala Glu Thr I^u ftlet Gin Leu ;PrQ_Glu Leu Tyx; Arg Ser , 
1170 M ■ 1175 ' . * .1180 

ggt cag aaa tea gag gca tac tta ccc cat acc ttc. tgg ate acc t£a 5600 
Gly Gin Lys »Ser Glu Ala Tyr Leu Pro His Thr Phe Trp lie Thr Leu 
1185 1190 1195 ■ ■ 1200 

ttg gat get ttt tat caa age ctg gtc tgc ttc ttt gtg cct tat ttt 3648 
Leu Asp Ala Phe Tyr Gin Ser Leu Val Cys Phe Phe Val Pro Tyr Phe 
1205 . 1210 1215 

' , ' ' ' 

acc tac cag ggc tea gat act, gac ate ttt gca ttt gga aac ccc ctg 3696 

Thr Tyr Gin Gly Ser Asp Thr Asp lie Phe Ala Phe Gly Asn Pro Leu 

} 1220 1225 1230 

aac aca gee get ctg ttc ate gtt etc etc cat ctg gtc att gaa age 3744 
Asn Thr Ala Ala Leu Phe lie Val Leu Leu His Leu Val lie Glu Ser 
1^35 ' 1240 . 1245 

aag agt ttg" act tgg att cac ttg ctg gtc ate att ggt ago ate ttg 3792 
Lys Ser Leu Thr Trp lie His Leu Leu Val He lie Gly Ser He Leu 
1250 1255 1260 

tct tat ttt tta ttt gee ata gtt ttt gga* gee atg tgt cjta act tgci 3840 
Ser Tyr Phe Leu Phe Ala He Val Phe Gly Ala Met Cys Val Thr Cys 
1265 1270 , 1275 • ' 1280 

aac cca cca tec aac cct tac tgg att atg cag gag cac atg ctg gat 3888 
Asn Pro Pro Ser Asn Pro Tyr Trp lie Met Gin Glu His Met Leu Asp 
1285 1290 1295 

cca gta ttc tac tta gtt tgt ate etc acg acg tec att . get ctt ctg 3936 
Pro Val Phe Tyr Leu Val Cys lie Leu Thr Thr Ser He Ala Leu Leu 
1300 1305 1310 

ccc agg ttt gta tac aga gtt ctt cag gga tec ctg ttt cca tct cca 3984 
Pro Arg Phe Val Tyr Arg Val Leu Gin Gly Ser Leu Phe Pro Ser Pro 
1315 1320 1325 

att ctg aga get aag cac ttt gac aga eta act cca gag gag agg act 4 032 
He Leu Arg Ala Lys His Phe Asp Arg Leu Thr Pro Glu Glu Arg Thr 
1330 1335 1340 

aaa get etc aag aag tgg aga ggg get gga aag atg aat caa gtg aca 4080 
Lys Ala Leu Lys Lys Trp Arg Gly Ala Gly Lys Met Asn Gin Val Thr 
1345 1350 1355 1360 

tea aag tat get aac caa tea get ggc aag tea gga aga aga ccc atg 4128 
Ser Lys Tyr Ala Asn Gin Ser Ala Gly Lys Ser Gly Arg Arg Pro Met 
1365 1370 1375 

cct ggc cct tct get gta ttt gca atg aag tea gca act tec tgt get 4176 
Pro Gly Pro Ser Ala Val Phe Ala Met Lys Ser Ala Thr Ser Cys Ala 
1380 1385 1390 

att gag caa- gga aac tta tct ctg tgt gaa act get tta gat caa ggc 4224 
He Glu Gin Gly Asn Leu Ser Leu Cys Glu Thr Ala Leu Asp Gin Gly 
1395 1400 1405 
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tac tct gaa act aag gcc ttt gag atg get gga ccc tec aaa ggt aaa 
Tyr Ser Glu Thr Lys ( $ia .Phe ,Glu Met- A).a Gly Pro Ser Lys Gly Lys 
1410 ' 1415 ' 1420 



4272 



gaa age tag 
Glu Ser * • 
1425 



4281 



<210> 19 
<211> 7221 
<212> DNA 

<213> Homo sapiens 



<220> , 
<221> CDS 
<222> (167) 



(3700) 



<400> 19 

gecgegggat gggaacgegg cgcggggagt gaggcagt.gg cggcggcggc ggtaagcgga 60 
acttcggccc gaggggctcg cccgctcccg cctctgtctt gtcggcctcc' acctgcagcc 120 
ccgcggcccc cgcgccccgc gggacccgga cggcgacgac ggggga atg tgg cgc 175 

" Met Trp Arg 

' .. . • - 1 . . , 

tgg ate egg cag cag-ctg ggt ttt gac cca cca cat cag agt gac aca 223 
Trp lie Arg Gin Gin Leu Gly Phe Asp Pro Pro His Gin Ser Asp Thr 
5 10 15 



aga ace ate tac gta gcc cac agg ttt cct cag aat ggc ctt tac aca 
Arg Thr He Tyr Val Ala His Arg Phe Pro Gin Asn Gly Leu Tyr Thr 
20 25 1 30 35 



271 



cct cag aaa ttt ata gat aac agg ate att tea tct aag tac act gtg 
Pro Gin Lys Phe He Asp Asn Arg He He Ser Ser Lys Tyr Thr Val 
40 45 50 



319 



tgg aat ttt gtt cca aaa aat tta ttt gaa cag ttc.aga aga gtg gca 
Trp Asn Phe Val Pro' Lys Asn Leu Phe Glu Gin Phe Arg Arg Val Ala 
55 '60 65 



367 



aac ttt tat ttt ctt att ata ttt ttg gtt cag ctt atg att gat aca 
Asn Phe Tyr Phe Leu lie lie Phe Leu Val Gin Leu Met lie Asp Thr 
70 75 80 



415 



cct ace agt cca gtt ace agt gga ctt cca tta ttc ttt gtg ata aca 
Pro Thr Ser Pro Val Thr Ser Gly Leu Pro Leu Phe Phe Val lie Thr 
85 90 * 95 



4 63 



gta act gcc ata aag cag gga tat gaa gat tgg tta egg cat aac tea 
Val Thr Ala He Lys Gin Gly Tyr Glu Asp Trp Leu Arg His Asn Ser 
100 105 110 115 



511 



gat aat gaa gta aat gga get cct gtt tat gtt gtt cga agt ggt ggc 
Asp Asn Glu Val Asn Gly Ala Pro Val Tyr Val Val Arg Ser Gly Gly 
120 125 130 



559 



ctt gta aaa act aga tea aaa aac att egg gtg ggt gat att gtt cga 
Leu Val Lys Thr Arg Ser Lys Asn He Arg Val Gly Asp He Val Arg 
135 140 145 



607 
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ata gcc aaagat gaa att ttt cct gca gac ttg gtg ctt ctg tec tea 655 
lie Ala Lys Asp Glu lie Phe Pro Ala Asp Leu- Val Leu Leu Ser Ser 
150 155 160 

gat cga, ctg gat ggt tec tgt cac gtt aca act got agt ttg gac gga 703 
Asp Arg Leu Asp Gly Ser Cys His Val Thr Thr Ala Ser Leu .Asp Gly 
165 170 1,75 

gaa act aac ctg aag aca cat gtg gca gtt cca gaa aca gca tta tta 751 
Glu Thr Asn Leu Lys Thr His Val Ala Val Pro Glu Thr Ala Leu Leu 
180 185. 190 * 195 

T 

■ * - I » ■ 

caa aca gtt gcc aat ttg gac act eta gta get gta ata gaa tgc cag .799 
Gin Thr Val Ala Asn Leu Asp Thr Leu Val Ala Val lie Glu Cys Gin 
200 , 205 .210 

baa cca gaa gca gac tta tac aga ttc atg gga cga atg ate ata acc 847 
Gin Pro Glu Ala Asp Leu Tyr Arg Phe Met Gly Arg Met lie lie Thr 
215 220 225 

caa caa atg gaa gaa att gta aga cct ctg ggg ccg gag agt etc ctg 8 95 
Gin Gin Met Glu Glu lie Val Arg Pro Leu Gly Pro Glu Ser Leu Leu 
230 235 240 

ctt cgt gga gcc aga tta aaa aac aca aaa gaa att ttt ggt gtt gcg 943 
Leu Arg Gly Ala Arg Leu Lys Asn Thr Lys Glu lie Phe Gly Val Ala 
245 250 255 • 

gta tac act gga atg gaa act aag atg gca tta aat tac aag age aaa 991 
Val Tyr Thr Gly Met Glu Thr Lys Met Ala Leu Asn Tyr Lys Ser Lys 
260 265 270 275 

tea cag aaa cga tct gca gta gaa aag tea atg aat aca ttt ttg ata 1039 
Ser Gin Lys Arg Ser Ala Val Glu Lys Ser Met Asn Thr Phe Leu lie 
280 285 290 

att tat eta gta att ctt ata tct gaa get gtc ate age act ate ttg 1087 
lie Tyr Leu Val lie Leu lie Ser Glu Ala Val He Ser Thr He Leu 
295 300 305 

aag tat aca. tgg.caa get gaa gaa aaa tgg gat gaa cct tgg tat aac 1135 
,Lys Tyr Thr Trp Gin Ala Glu Glu Lys Trp Asp Glu Pro Trp Tyr Asn 
310 . 315 320 

caa aaa aca gaa cat caa aga aat age agt aag att ctg aga ttt att 1183 
Gin Lys Thr Glu His Gin Arg Asn Ser Ser Lys He Leu Arg Phe He 
325 330 335 

tea gac ttc ctt get ttt ttg gtt etc tac aat ttc ate att cca att 1231 
Ser Asp Phe Leu Ala Phe Leu Val Leu Tyr Asn Phe He He Pro He 

340 " 345 350 355 

i 

tea tta tat gtg aca gtc gaa atg cag aaa ttt ctt gga tea ttt ttt 1279 
Ser Leu Tyr Val Thr Val Glu Met Gin Lys Phe Leu Gly Ser Phe Phe 
360 365 370 

att ggc tgg gat ctt gat ctg tat cat gaa gaa tea gat cag aaa get 1327 
He Gly Trp Asp Leu Asp Leu Tyr His Glu Glu Ser Asp Gin Lys Ala 
375 380 385 
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i ■ 

caa gtc aat act tec gat ctg aat gaa gag ctt gga cag gta. gag tac 1375 

Gin Val Asn Thr Ser Asp Leu Asn Glu Glu Leu Gly Gin Val Glu Tyr 

390 395. 400 

gtg ttt aca gat aaa act ggt aca ctg aca gaa aa.t gag atg cag ttt 1423 , 
Val Phe Thr Asp Lys Thr Gly Thr Leu Thr Glu Asn Glu Met Gin Phe ' 
405 410 415 ; 

egg gaa tgt tea att aat ggc atg aaa tac caa gaa att aat ggt. aga 1471 
Arg Glu Cys Ser lie Asn Gly Met Lys Tyr Gin . Glu lie Asn Gly Arg 
420 ' 425 ' ' 430 ; 435 " 

t ■ • 

ctt gta ccc gaa gga cca aca cca gac tct tea gaa gga aac tta tct 1519 
Leu Val Pro Glu Gly Pro Thr Pro Asp Ser Ser Glu Gly Ash Leu Ser 
440 , 445 450 

itat ctt agt agt tta tec cat ctt aac aac tta' tec cat ctt aca acc 1567 
Tyr Leu Ser Ser Leu Ser His 'Leu Asn Asn Leu Ser His Leu Thr Thr 
455 4 60 4 65 

agt tec tct ttc aga acc agt cct gaa aat gaa act gaa eta att aaa 1615 
Ser Ser Ser Phe Arg Thr Ser Pro Glu Asn Glu Thr Glu Leu lie Lys 
470 475 480 

gaa cat gat etc ttc ttt aaa gca gtc . agt etc tgt cac act gta cag 1663 
Glu His Asp Leu Phe Phe Lys Ala Val Ser Leu Cys His Thr Val Gin 
485 490 495 , 

att age aat gtt caa act gac tgc act ggt gat ggt ccc tgg caa tec 1711 
lie Ser Asn Val Gin Thr Asp Cys Thr Gly Asp Gly Pro Trp Gin Ser 
500 505 510 515 

aac ctg gca cca teg cag ttg gag tac tat gca tct tea cca gat gaa 1759 
Asn Leu Ala Pro Ser Gin Leu Glu Tyr Tyr Ala Ser Ser Pro Asp Glu 
520 525 530 

aag get eta gta gaa get get gca agg att ggt att gtg ttt att ggc 1807 
Lys Ala Leu Val Glu Ala Ala Ala Arg He Gly He Val Phe lie Gly 
535 540 545 

aat tct gaa gaa act atg gag gtt aaa act ctt gga aaa ctg gaa egg 1855 
Asn Ser Glu Glu Thr Met Glu Val Lys Thr Leu Gly Lys Leu Glu Arg 
550 555 560 

tac aaa' ctg ctt cat att ctg gaa ttt gat tea gat cgt agg aga atg 1903 
Tyr Lys Leu Leu His lie Leu Glu Phe Asp Ser Asp Arg Arg Arg Met 
565 570 575 

agt gta att* gtt cag gca cct tea ggt gag aag tta tta ttt get aaa 1951 
Ser Val lie Val Gin Ala Pro Ser Gly Glu Lys Leu Leu Phe Ala Lys 
580 585 590 595 

gga get gag tea tea att etc cct aaa tgt ata ggt gga gaa ata gaa 1999 
Gly Ala Glu Ser Ser He Leu Pro Lys Cys He Gly Gly Glu He Glu 
600 605 610 

aaa acc aga att cat gta gat gaa ttt get ttg aaa ggg eta aga act 2047 
Lys Thr Arg He His Val- Asp Glu Phe Ala Leu Lys Gly Leu Arg Thr 
615 620 625 
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ctg tgt ata gca tat aga aaa ttt aca tea aaa gag tat gag gaa ata 2095. 
■Leu Gys lie Ala Tyr ( Arg &ys lt Phe Thr Ser Lys Glu Tyr Glu Glu I^e . 
630 ' 635 640 

■ii | • • y . 

gat aaa cgc ata ttt gaa gec agg act ejee ttg cag cag egg gaa'gag 2143 
Asp Lys Argi lie Phe Glu Ala Arg Thr 'Ala Leu Gin Gin Arg Glu Glu . 

645 650 ~ ' ; 655 ".' 

aaa ttg gca get gtt ttc cag ttc ata gag aaa gac ctg ata tta ctt 2191 
Lys Leu Ala Ala Val Phe Gin Phe lie Glu Lys Asp, Leu lie Leu Leu 
660 665. 670, 675 ' 

gga gec aca gca gta gaa gac aga eta caa gat aaa gtt cga gaa act 2239 
Gly Ala Thr Ala »Val Glu Asp Arg Leu Gin Asp Lys Val Arg Glu Thr 
, 680 , 685 690 

t • 

att gaa gca ttg a£a atg get ggt ate aaa gta tgg gta ctt act ggg 2287 

He Glu A,la Leu Arg Met Ala Gly lie Lys Val Trp Val Leu Thr Gly 
695" 700 - 705 

gat aaa cat gaa aca get gtt agt gtg agt tta tea tgt ggt cat ttt 2335 
Asp Lys His Glu Thr Ala Val Ser Val Set Leu Ser Cys Gly His Phe 

710 715 720 

i • - . 

cat aga. acc atg aac ate ctt gaa ctt ata aac cag' aaa tea gac. agd 2383 
His Arg Thr Met Asn. 'lie Leu Glu Leu He Asn Gin Lys Ser Asp Ser 
725 730 , 735 

gag tgt get gaa caa ttg agg cag ctt gec aga aga att aca gag gat 2431 
Glu Cys Ala Glu Gin Leu Arg Gin Leu Ala Arg Arg tie Thr Glu Asp 
740 745 750 755 

cat gtg att cag cat ggg ctg gta gtg gat ggg ace age eta tct ctt 2479 
His Val He Glh His Gly Leu Val Val Asp Gly Thr Ser Leu Ser Leu 
760 765 770 

gca etc agg gag cat gaa aaa eta ttt atg gaa gtt tgc aga aat tgt 2527 
Ala Leu Arg Glu His Glu Lys Leu Phe Met Glu Val, Cys Arg Asn Cys 
775 780 ' 785 

tea get gta tta tgc tgt cgt atg get cca ctg cag aaa gca aaa gta 2575 
Ser Ala Val Leu Cys Cys Arg Met Ala Pro Leu Gin Lys Ala Lys Val 
790 795* 800 

ata aga eta ata aaa ata tea cct gag aaa cct ata aca ttg get gtt 2623 
He Arg Leu He Lys He Ser Pro Glu Lys Pro lie Thr Leu Ala Val 
805 810 815 

ggt gat ggt get aat gac gta age atg ata caa gaa gee cat gtt ggc 2671 
Gly Asp Gly Ala Asn Asp Val Ser Met lie Gin Glu Ala His Val Gly 
820 825 830 835 

ata gga ate atg ggt aaa gaa gga aga cag get gca aga aac agt gac 2719 
He Gly He Met Gly Lys Glu Gly Arg Gin Ala Ala Arg Asn Ser Asp 
840 845 850 

tat gca ata gec aga ttt aag ttc etc tec aaa ttg ctt ttt gtt cat 2767 
Tyr Ala He Ala Arg Phe Lys Phe Leu Ser Lys Leu Leu Phe Val His 
855 860 865 
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ggt cat ttt tat tat att aga ata get acc ctt gta cag tat ttt ttt .2815 
Gly His Phe Tyr Tyr- lie- Arg lie Ala Thr Leu« Val Gin Tyr Phe Phe 
870 875 880 



tat aag, aat gtg tgc ttt ate aca ccc cag ttt tta tat cag ttc tac 
Tyr Lys' Asn Val Cys Phe lie Thr Pro Gin Phe Lep Tyr Gin .Phe Tyr 
885 890 8p5 



Igaa cag cat gta gac cct cat cjtg tta caa aat aag ccc acc ctt tat 
Glu Gin His Val Asp Pro His Val Leu Gin Asn Lys Pro Thr Leu Tyr 
935 940 945 



2863 



tgt ttg ttt tct cag caa aca ttg tat gac age gtg tac ctg act tta 2911 

Cys Leu Phe Ser Gin Gin Thr Leu Tyr Asp Ser Val Tyr Leu Thr Leu 

900 905. '910 ' 915 

• ■ i 1 . 

tac aat att tgt ttt act tec eta cct att ctg ata tat agt ctt ttg. .2959 

Tyr Asn lie Cys Phe Thr Ser Leu Pro lie Leu lie Tyr Ser Leu Leu 

920 , 925 . 930 



3007 



cga gac att agt aaa aac cgc etc tta agt att aaa aca ttt ctt tat 3055 
Arg Asp lie Ser Lys Asn Arg Leu Leu Ser lie Lys Thr Phe Leu Tyr 
950 . 955 960 

tgg acc ate ctg ggc ttc agt cat gec ttt att ttc ttt ttt gga tec 3103 
Trp Thr lie Leu Gly Phe Ser His Ala Phe lie Phe Phe Phe Gly Ser 
965 970 975 . 

tat tta eta ata ggg aaa gat aca. tct ctg ctt gga aat ggc cag atg 3151 
Tyr Leu Leu lie Gly Lys Asp Thr Ser Leu Leu Gly Asn Gly Gin Met 
980 985 990 995 

ttt gga aac tgg . aca ttt ggc act ttg gtc ttc aca gtc atg gtt att 3199 
Phe -Gly Asn Trp Thr Phe Gly Thr Leu Val Phe Thr Val Met Val lie 
1000 1005 1010 

aca gtc aca gta aag atg get ctg gaa act cat ttt tgg act tgg ate 3247 
Thr Val Thr Val Lys Met Ala Leu Glu Thr His- Phe Trp Thr Trp lie 
1015 " 1020 1025 

aac cat etc gtt acc tgg gga tct att ata ttt tat ttt gta ttt tec 3295 
,Asn His Leu Val Thr Trp Gly Ser lie lie Phe Tyr Phe Val Phe Ser 
1030 1035 1040 

ttg ttt "tat gga ggg att etc tgg cca ttt ttg ggc tec cag aat atg 3343 
Leu Phe Tyr Gly Gly lie Leu Trp Pro Phe Leu Gly Ser Gin Asn Met 
1045 ~ 1050 1055 

tat ttt gtg ttt att cag etc ctg tea agt ggt tct get tgg ttt gec 3391 
tyr Phe Val Phe lie Gin Leu Leu Ser Ser Gly Ser Ala Trp Phe Ala 
1060 1065 1070 1075 

ata ate etc atg gtt gtt aca tgt eta ttt ctt gat ate ata aag aag 3439 
lie lie Leu Met Val Val Thr Cys Leu Phe Leu Asp lie lie Lys Lys 
1080 1085 1090 

gtc ttt gac cga cac etc cac cct aca agt act gaa aag gca cag ctt 34 87 
Val Phe Asp Arg His Leu His Pro Thr Ser Thr Glu Lys Ala Gin Leu 
1095 . 1100 1105 
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act gaa aca' aat gea ggt ate aag tgc ttg gac tec atg tgc tgt ttc 
Thr Glu Thr Asn Ala ply Tie iLys Cys Leu Asp Ser Met Cys Cys Phe • 
1110 1115 1120 

ccg gaa gga gaa gca gcg tgt gea tct gpt gga aga atg ctg gaa cga^ 
Pro Glu Gly 'Glu Ala Ala Cys Ala Ser Val Gly Arg Met Leu Glu Arg % 
1125 1130 . 1135V 

gtt ata gga aga tgt agt cca acq cac ate age agt tea tgg agt gca 

Val lie Gly Arg Cys Ser Pro Thr His lie Ser Ser .Ser Trp Ser Ala 

1140. ~~ 114? 1150 1155 

i . ' 

teg gat cct ttc tat ace aac gac agg age ate ttg act etc tec aca 

Ser Asp Pro Phe Tyr Thr Asn Asp Arg Ser lie Leu Thr Leu Ser Thr 

, 1160. 1165 ; 1170 

atg gac tea tct act tgt taa aggggcagta gtactttgtg ggagccagtt 
Met Asp S^r Ser Thr Cys * 
1175 

cacctccttt cctaaaattc agtgtgatca ccctgttaat ggccac'acta 1 get ctgaaat 37 90 
taatttccaa aatctttgta gtagttcata cccactcaga gttataatgg caaacaaaca 3850 
gaaagcatta gtacaagccc ctcccaacac ccttaatttg aatctgaaca tgttaaaatt 3910 
tgagaataaa gagacatttt tcatctcttt gtctggtttg tcccttgtgc ttatgggact 3970 ■ 
cctaatggca tttpagtctg ttgetgagge cattat'attt taatataaat gtagaaaa'aa 4 030 
gagagaaatc ttagtaaagfei gtatttttta gtattagctt gattattga,c tcttctattt 4090 
aaatctgett ctgtaaatta tgctgaaagt ttgecttgag aactctattt ttttattaga 4150 
gttatattta aagcttttca tgggaaaagt taatgtgaat actgaggaat tttggtccct 4210 
cagtgacctg tgttgttaat teattaatge attctgagtt cacagagcaa attaggagaa 4270 
tcatttccaa ccattattta ctgcagtatg gggagtaaat ttataccaat tcctctaact 4 330 
gtactgtaac acagcctgta aagttagcca tataaatgea agggtatatc atatatacaa 4 390 
atcaggaatc aggtcegtte accgaacttc aaattgatgt ttactaatat ttttgtgaca 4450 
gagtataaag accctatagt gggtaaatta gatactatta gcatattatt aatttaatgt 4510 
ctttatcatt ggatcttttg catgetttaa tctggttaac atatttaaat ttgctttttt 4570 
tctctttacc tgaaggctet gtgtatagta tttcatgaca tcgttgtaca gtttaactat 4630 
caataaaaag tttggacagt atttaaatat tgcaaatatg tttaattata caaatcagaa 4690 
tagtatgggt aattaaatga atacaaaaag aagagectet ttctgcagcc gacttagaca 4750 
tgctcttccc tttctataag ctagatttta gaataaaggg tttcagttaa taatcttatt 4810 
ttcaggttat gtcatctaac ttatagcaaa ctaccacaat acagtgagtt ctgccagtgt 4 870 
cccagtacaa ggcatatttc aggtgtggct gtggaatgta aaaatgetea acttgtatca 4 930 
ggtaatgtta gcaataaatt aaatgctaag aatgattaat egggtacatg ttactgtaat 4 990 
taactcattg cacttcaaaa cctaacttcc atcctgaatt tatcaagtag ttcagtattg 5050 
tcatttgttt ttgttttatt gaaaagtaat gttgtcttaa gatttagaag tgattattag 5110 
cttgagaact attacccagc tctaagcaaa taatgattgt. atacatatta agataatggt 5170 
taaatgcggt tttaccaagt tttcccttga aaatgtaatt cctttatgga gatttattgt 5230 
gcagccctaa gcttccttcc cattteatga atataaggct tctagaattg gactggcagg 5290 
ggaaagaatg gtagagacag aaattaagac tttatccttg tttgcttgta aactattatt 5350 
ttcttgctaa tgtaacattt gtctgttcca gtgatgtaag gatattaagt tattaagcta 5410 
aatattaatt ttcaaaaata gtccttcttt aacttagata tttcatagct ggatttagga 5470 
agatctgtta ttctggaagt actaaaaaga ataatacaac gtacaatgtc tgcattcact 5530 
aattcatgtt ccagaagagg aaataatgaa gatatactca gtagagtact aggtgggagg 5590 
atatggaaat ttgctcataa aatctcttat aaaacgtgca tataacaaaa tgacacccag 5650 
taggectgea ttacatttac atgaccgtgt ttatttgeca tcaaataaac tgagtactga 5710 
caccagacaa agactccaaa gtcataaaat agectatgae caactgcagc aagacaggag 5770 
gtcagetege ctataatggt gcttaaagtg tgattgatgt aattttctgt actcaccatt 5830 
tgaagttagt taaggagaac tttatttttt taaaaaaagt aaatggcaac cactagtgtg 5890 
ctcatcctga actgttactc caaatccact ccgtttttaa agcaaaatta tcttgtgatt 5950 
ttaagaaaag agttttctat ttatttaaga aagtaacaat geagtctgea agctttcagt 6010 
agttttctag tgetatatte atcctgtaaa actcttacta cgtaaccagt aatcacaagg 6070 
aaagtgtccc etttgeatat ttctttaaaa ttctttcttt ggaaagtatg atgttgataa 6130 
ttaacttacc cttatctgcc aaaaccagag caaaatgeta aatacgttat tgctaatcag 6190 
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tggtctcaaa 
ggcaagcacc 
tgcctcagct- 
actacagcta 
ttctaactgt 
gccacaccca 
tacttgcatt 
gtgagattga 
aaaattgaca 
ggctgtagga 
gtatgatata 
tattaatttt 
ttcataaaag 
ctataaaaaa 
gaggaatgag 
actgtggctg 
ttattgtaca 
ctttcctata 



•tcgatttgcc 
aagtcagttt 
tcagcaggcg 
cgaagacctg 
cctctttctt 
ggccctatcc 
ctgccctacg 
acttatctga 
gttaagagcc 
aaagggagaa 
agcaggtatt 
ggtccccacg 
taaaacacta 
tagatcataa 
Jzatctggaaa 
tctaatgtaa 
gctactcata 
t 



tccctttgcc 
ccaaaa^tgc 
taggctcacc 
cXgt^gagt"r 
gggtctaaag 
tgaacaggag 
gttagtacca 
tcgcttgaga 
aaaagttttt 
tcttccatgt 
a.ataaaaata 
taca^acatt 
ttaaagtgct 
ctcatgatat 
tattgtagca 
tcctttaaaa 
attttttaaa 



tcgtctgagg 
ccct9agctg 
ctgggcggag 

ctcataatac 

actaaacaga 

ggactgaggt 

ctcctaatag 

aaaatatgct 

tggtgttttt 

aeacaccaaa t 

ttatttctat 

gttttatgtg 

gtttgtaatc 

atacttggtt 

attctctgca 

gtttatgaag 



gctgtaagcc 

ctttaagtga . 

caaagtatgg 

gayaatttaL 

acaaaggctt 

ggcaaatcaa 

catttctact 

gcaggagtca 

actctgaaaa 

cctgtaaaga 

gagttacgta 

tttgaaatga 

aaataacttg 

atggtaattt 

taaaattttg 

ttgtcagtaa 

ttatatttat 



tgaagatagt 

ctcagca.ccc 

gccagggaga 

yy Lcly^dlL 

ccagacctga 

ccctaggaaa 

ggaaaagatt 

aggccactag 

atctcgtgaa 

tcagtttggg 

aaacatgttt' 

gttatctatt 

aatgttgttc 

agatttttat 

gacctgagac 

atgtagtata 

caaataaaaa 



625a 
6310. 
6370- 
-6^0— 
64 90 
6550 
6610 
6670 
6730 
6790 . 
6850 
6910 
6970 
703.0 
7090 
7150 
7210 
7 221 



<210> 20 * 
<211> 1177 
<212> PRT 

<213> Homo sapiens » ( 

<400> 20 - 1 ..• 

Met Trp Arg Trp lie Arg Gin Gin Leu Gly Phe Asp Pro Pro His Gin 

1 5 10 15 

Ser Asp Thr Arg Thr He Tyr Val Ala His Arg Phe Pro Gin Asn Gly 

20 25 30 

Leu Tyr Thr Pro Gin Lys Phe He Asp Asn Arg He He Ser Ser Lys 

35 ' 40 . 45 

Tyr Thr Val Trp Asn Phe Val Pro Lys Asn Leu Phe Glu Gin Phe Arg 

50 55 60 

Arg Val Ala Asn Phe Tyr Phe Leu He He Phe Leu Val Gin Leu Met 
65 70 75 80 

He Asp Thr Pro Thr Ser Pro Val Thr Ser Gly Leu Pro Leu Phe Phe 

85 90 ' . 95 

Val He Thr Val Thr Ala lie Lys Gin Gly Tyr Glu Asp Trp Leu Arg 

100 105 110 

His Asn Ser Asp Asn Glu Val Asn Gly Ala Pro Val Tyr Val Val Arg 

115 120 125 

Ser Gly Gly Leu Val Lys Thr Arg Ser Lys Asn lie Arg Val Gly Asp 

130 135 . 140 

He Val Arg He Ala Lys Asp Glu lie Phe Pro Ala Asp Leu Val Leu 
145 150 155 160 

Leu Ser Ser Asp Arg Leu Asp Gly Ser Cys His Val Thr Thr Ala Ser 

165 170 175 

Leu Asp Gly Glu Thr Asn Leu Lys Thr His Val Ala Val Pro Glu Thr 

180 185 190 

Ala Leu Leu Gin Thr Val Ala Asn Leu Asp Thr Leu Val Ala Val He 

195 200 205 

Glu Cys Gin Gin Pro Glu Ala Asp Leu Tyr Arg Phe Met Gly Arg Met 

210 215 220 

lie He Thr Gin Gin Met Glu Glu lie Val Arg Pro Leu Gly Pro Glu 
225 230 235 240 

Ser Leu Leu Leu Arg Gly Ala Arg Leu Lys . Asn Thr Lys Glu He Phe 

245 ^ 250 255 

Gly Val Ala Val Tyr Thr Gly Met Glu Thr Lys Met Ala Leu Asn Tyr 
260 . 265 270 
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Lys 


Ser 


Lys 


Ser Gin 


Lys Arg 


Ser 


Ala 


Val 


Glu Lys 


Ser 


Met 


Asn 


Thr 






275 








280 






• 




285 








Phe 


Leu 


He 


He Tyr Leu Val 


lie 


Leu 


lie 


Ser 


Glu 


Ala 


Val 


He 


Ser 




290 








295 








. ,300 


i 








Thr 


He, Leu Lys Tyr 


Thr Trp. Gin Ala Glu Glu Lys 


Trp As*p 


Glu 


Pro 


305 








310 










315- 










320 


Trp 


Tyr 


Asn 


Gin Lys 


Thr 


Glu 


His 


Gin Arg 


Asn 


Ser 


Ser 


Lys 


He 


Leu 








325 










330 










335 




Arg 


Phe 


He 


Ser Asp 
340 


Phe 


Leu 


Ala 


Phe 
345, 


Leu 


Val 


Leu 


Tyr 


Asn 
350 


Phe 


lie 


He 


Pro 


He 


Ser Leu 


Tyr Val 


Thr 


Val 


Glu 


Met 


Gin 


Lys 


Phe 


Leu 


Gly 






355 








360 










365 








Ser 


Phe 


Phe 


He Gly 


Trp Asp 


Leu Asp Leu Tyr 


His 


Glu 


Glu 


Ser 


Asp 




370 








375 










380 










Gin 


Lys 


Ala 


Gin. Val 


Asn 


Thr 


Ser Asp 


Leu 


Asn 


Glu 


Glu 


Leu 


Gly 


Gin 


385 








390 










395 










400 


j. 

Va^ 


Glu 


Tyr 


Val Phe 


Thr Asp tys Thr Gly Thr 


Leu 


Thr 


Glu 


Asn 


Glu 








405 










410 










415 




Met 


Gin 


Phe 


Arg Glu 


Cys 


Ser 


lie 


Asn 


Gly Met 


Lys 


Tyr 


Gin 


Glu 


lie 








4 20 








425 










430 






As'n 


Gly Arg 


Leu Val 


Pro 


Glu 


Gly 


Pro 


Thr 


Pro Asp 


Ser 


Ser 


Glu 


Gly 






435 








440 










4 45 








Asn 


Leu 
450 


Ser 


Tyr Leu 


Ser 


Ser 
455 


Leu 


Ser 


His 


Leu 

• 


Asn 
460 


Asn 


Leu 


Ser 


His 


Leu 


Thr 


Thr 


Ser Ser 


Ser 


Phe 


Arg 


Thr 


Ser 


Pro 


Glu 


Asn 


Glu 


Thr 


Glu 


4 65 








470 










475 










480 


Leu 


lie 


Lys 


Glu His 
485 


Asp 


Leu 


Phe 


Phe 


Lys 
490 


Ala 


Val 


Ser 


Leu 


Cys 
495 


His 


Thr 


Val 


Gin 


He Ser 


Asn Val 


Gin 


Thr 


Asp 


Cys 


Thr 


Gly Asp 


Gly 


Pro 








500 








505 










510 






Trp 


Gin 


Ser 


Asn Leu 


Ala 


Pro 


Ser 


Gin 


Leu 


Glu 


Tyr 


Tyr 


Ala 


Ser 


Ser 






515 








520 










525 






Val 


Pro 


Asp 


Glu 


Lys Ala 


Leu 


Val 


Glu 


Ala Ala 


Ala 


Arg 


He 


Gly 


He 




530 








535 










540 










Phe 


lie 


Gly Asn Ser 


Glu 


Glu 


Thr 


Met 


Glu 


Val 


Lys 


Thr 


Leu 


Gly 


Lys 


545 








550 










555 










560 


Leu 
i 


Glu 


Arg 


Tyr Lys 
565 


Leu 


Leu 


His 


He 


Leu 
570 


Glu 


Phe 


Asp 


Ser 


Asp 
575 


Arg 


Arg 


Arg 


Met 


Ser Val 
580 


He 


Val 


Gin 


Ala 

585 


Pro 


Ser 


Gly 


Glu 


Lys 
590 


Leu 


Leu 


Phe 


Ala 


Lys 
595 


Gly Ala 


Glu 


Ser 


Ser 
600 


He 


Leu 


Pro 


Lys 


Cys 
605 


He 


Gly 


Gly 


Glu 


He 


Glu Lys Thr 


Arg 


He 


His 


Val 


Asp 


Glu 


Phe 


Ala 


Leu 


Lys 


Gly 




610 








615 










.620 










Leu 


Arg 


Thr 


Leu Cys 


He 


Ala 


Tyr 


Arg 


Lys 


Phe 


Thr 


Ser 


Lys 


Glu 


Tyr 


625 








630 










635 










640 


Glu 


Glu 


He 


Asp Lys 
645 


Arg 


He 


Phe 


Glu 


Ala 
650 


Arg 


Thr 


Ala 


Leu 


Gin 
655 


Gin 


Arg 


Glu 


Glu 


Lys Leu 


Ala 


Ala 


Val 


Phe 


Gin 


Phe 


He 


Glu 


Lys 


Asp 


Leu 






660 








665 










670 






He 


Leu 

• 


Leu 
675 


Gly Ala 


Thr 


Ala 


Val 
680 


Glu 


Asp 


Arg 


Leu 


Gin 
685 


Asp 


Lys 


Val 


Arg 


Glu 


Thr 


He Glu 


Ala 


Leu Arg Met 


Ala Gly, He 


Lys 


Val 


Trp 


Val 




690 








695 










700 










Leu 


Thr Gly Asp Lys 


His 


Glu Thr 


Ala 


Val 


Ser 


Val 


Ser 


Leu 


Ser 


Cys 


705 








710 










715 










720 


Gly 


His 


Phe 


His Arg 
725 


Thr 


Met 


Asn 


He 


Leu 
730 


Glu 


Leu 


He 


Asn 


Gin 
735 


Lys 


Ser 


Asp 


Ser 


Glu Cys 
740 


Ala 


Glu 


Gin 


Leu 
745 


Arg 


Gin 


Leu 


Ala 


Arg 
750 


Arg 


He 
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Thr 


Glu 


Asp 


His 


Val 


lie 


Gin 


His 


Gly 


Leu 


Val 


Val. Asp 


Gly 


Thr 


Ser 






755 










.7 60 






• i 




765 








Leu 


Ser 


Leu 


Ala 


Leu 


Arg 


Glu 


His 


Glu 


Lys 


Leu 


Phe 


Met 


Glu 


Val 


Cys 




770 










775 




■ 






780 










Arg 


Asn 


Cys 


Ser 


Ala 


Val 


Leu 


Cys 


Cys 


Arg 


Met 


Ala 


Pro 


Leu 


Gin 


Lys 


785 










790 










7 95 










800 


Ala 


Lys 


Val 


He 


Arg 


Leu 


He 


Lys 


He 


Ser 


Pro Glu 


Lys 


Pro 


He 


Thr 








805 










810 










815 




Leu 


Ala 


Val 


Gly Asp .Gly Ala Asn Asp- 


Val 


Ser 


Met 


He 


Gin 


Glu Ala 








820 










82£ 










830 






His 


Val 


Gly 


He 


Gly lie Met Gly Lys Glu Gly Arg 


Gin 


Ala 


Ala 


Arg 






835 










8.40 






i 




846 








Asn 


Ser 


Asp 


Tyr Ala He Ala. Arg Phe Lys Phe 


Leu 


Ser 


Lys 


Leu 


Leu 




850 










855 










8 60 










Phe 


Val 


His 


Gly His 


Phe 


Tyr 


Tyr 


lie 


Arg 


He 


Ala 


Thr 


Leu 


Val 


Gin 


865 










870 










875 










880 : 


'Tyr 


Phe 


Phe 


Tyr 


Lys 


Asn 


Val 


Cys 


Phe 


He 


Thr 


Pro 


Gin 


Phe 


Leu 


Tyr 








885 








890 










895 




Gin 


Phe 


Tyr 


Cys 


Leu 


Phe 


Ser 


Gin 


Gin 


Thr 


Leu : Tyr 


Asp 


Ser 


Val 


Tyr 






900 










905 










910 






Leu 


Thr 


Leu 
915 


Tyr 


Asn 


lie 


Cys 


Phe 
920 


Thr 


Ser 


Leu 


Pro 


He 
925 


Leu 


He. 


Tyr 


Ser 


Leu 
930 


Leu 


Glu 


Gin 


His 


Val 
935 


Asp 


Pro 


His 


Val 


Leu 
940 


Gin 

• 


Asn 


Lys 


Pro 


Thr 


Leu 


Tyr 


Arg 


Asp 


He 


Ser 


Lys 


Asn 


Arg 


Leu 


Leu 


Ser 


lie 


Lys 


Thr 


945 








950 










955 










960 


Phe 


Leu 


Tyr 


Trp 


Thr 
965 


He 


Leu 


Gly 


Phe 


Ser 
970 


His 


Ala 


Phe 


He 


Phe 
97 5 


Phe 


Phe 


Gly 


Ser 


Tyr 


Leu 


Leu 


lie 


Gly 


Lys 


Asp 


Thr 


Ser 


Leu 


Leu 


Gly 


Asn 






980 










985 










990 






Gly Gin 


Met 


Phe 


Gly 


Asn 


Trp 


Thr 


Phe 


Gly Thr 


Leu 


Val 


Phe 


Thr 


Val 






995 










1000 








1005 






Met 


Val 


He 


Thr 


Val 


Thr 


Val 


Lys 


Met 


Ala 


Leu 


Glu 


Thr 


His 


Phe 


Trp 




1010 








1015 








1020 








Thr 


Trp 


lie 


Asn 


His 


Leu 


Val 


Thr 


Trp Gly 


Ser 


lie 


He 


Phe 


Tyr 


Phe 


1025 








1030 








1035 








1040 


Val 


Phe 


Ser 


Leu 


Phe 


Tyr 


Gly 


Gly 


He 


Leu 


Trp 


Pro 


Phe 


Leu 


Gly 


Ser 










1045 








1050 








1055 


Gin 


Asn 


Met 


Tyr 


Phe 


Val 


Phe 


He 


Gin 


Leu 


Leu 


Ser 


Ser 


Gly 


Ser 


Ala 








1060 








1065 








1070 




Trp 


Phe 


Ala 


He 


lie 


Leu 


Met 


Val 


Val 


Thr 


Cys 


Leu 


Phe 


Leu 


Asp 


He 




1075 








1080 








1085 






'lie 


Lys 


Lys 


Val 


Phe 


Asp 


Arg 


His 


Leu 


His 


Pro 


Thr 


Ser 


Thr 


Glu 


Lys 




1090 








1095 








1100 








Ala 


Gin 


"Leu 


Thr 


Glu 


Thr 


Asn Ala Gly 


lie 


Lys 


Cys 


Leu Asp Ser 


Met 



1105 1110 1115 1120 

Cys Cys Phe Pro Glu Gly Glu Ala Ala Cys Ala Ser Val Gly Arg Met 

1125 1130 1135 

Leu Glu Arg Val He Gly Arg Cys Ser Pro Thr His lie Ser Ser Ser 

1140 ~ ~ 1145 1150 

Trp Ser Ala Ser Asp Pro Phe Tyr Thr Asn Asp Arg Ser He Leu Thr 

1155 1160 1165 

Leu Ser Thr Met Asp Ser Ser Thr Cys 
1170 1175 



<210> 21 

<211> 3534 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> CDS • |t " , . , , 

<222> (1) . . . (3534) - - 

<400> 21 ' 1 " * 9 

atg tgg cgc tgg ate egg cag cag ctg'tjgt ttt gac cca cca cat cag 48 
Met Trp Arg Trp lie Arg Gin Gin Leu Gly Phe Asp' Pro Pro His Gin 
1 5 10 15 

agt gac aca aga acc ate tac gta gec cac agg ttt ( cct cag aat ggc 96 
Ser Asp Thr Arg Thr lie Tyr Val : Ala His Arg Phe Pro Gin Asn Gly 1 
20 . 25 ' . 30 , 

ctt tac aca cct, cag aaa ttt ata gat aac agg ate att tea tct aag 144 
Leu Tyr Thr Pro Gin Lys Phe lie Asp Asn Arg lie He S,er Ser Lys 

3^ ,40 45 

i . 

tac act gtg tgg aat ttt gtt cca aaa aat tta ttt gaa cag ttc aga 192 
Tyr Thr Val Trp Asn Phe Val Pro Lys- Asn Leu Phe Glu Gin Phe Arg 
.50 55 .60 

aga gtg gca aac ttt tat ttt ctt att ata ttt ttg gtt cag ctt atg 240 

Arg Val Ala Asn. Phe Tyr Phe Leu He He Phe Leu Val Gin Leu Met 

65 70 , 1 ' 75 80 

' . ' . ' ' i 

att gat aca cbt acc. agt cca gtt acc agt gga ctt cca tta ttc ttt 288 

He Asp Thr Pro Thr Ser Pro Val Tfir Ser Gly Leu Pro 'Leu Phe Phe 

85 90 95 

gtg ata aca gta act gec ata aag cag gga tat gaa gat tgg tta egg 336 
Val lie Thr Val Thr Ala He Lys Gin Gly Tyr Glu Asp Trp Leu Arg 
100 ■ 105 • 110 

cat aac tea gat aat gaa gta aat gga get cct gtt tat gtt gtt cga 384 
His Asn Ser Asp Asn Glu Val Asn Gly Ala Pro Val Tyr- Val Val Arg 
115 120 125 

agt ggt ggc ctt gta aaa act aga tea aaa aac att egg gtg ggt gat 4 32 
Ser Gly Gly Leu Val Lys Thr Arg Ser Lys Asn lie Arg Val Gly Asp 
130 135 • .140 

att gtt cga ata gee aaa gat gaa att ttt cct gca gac ttg gtg ctt 480 
He Val Arg lie Ala Lys Asp Glu He Phe Pro Ala Asp Leu Val Leu 
145 150 155 160 

ctg tec tea gat cga ctg gat ggt tec tgt cac gtt aca act get agt 528 
Leu Ser Ser Asp Arg Leu Asp Gly Ser Cys His Val Thr Thr Ala Ser 
165 170 175 

ttg gac gga gaa act aac ctg aag aca cat gtg gca gtt cca gaa aca 57 6 
Leu Asp Gly Glu Thr Asn Leu Lys Thr His Val Ala Val Pro Glu Thr 
180 185 190 

gca tta tta caa aca gtt gee aat. ttg gac act eta gta get gta ata 624 
Ala Leu Leu Gin Thr Val Ala Asn Leu Asp Thr Leu Val Ala Val He 
195 200 205 

gaa tgc cag caa cca gaa gca gac tta tac aga ttc atg gga cga atg 672 
Glu Cys Gin Gin Pro Glu Ala Asp Leu Tyr Arg Phe Met Gly Arg Met 
210 215 220 
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ate ata ace caa caa atg gaa gaa att gta aga. cct ctg ggg ccg gag 720 . 

-lie lie Thr Gin Gin ^et Glu ,Glu lie Val Arg Pro- Leu Gly Pro G\u . 

225 230 235 ?40 

: i s , r 

agt etc ctg ctt cgt gga gee aga tta aaa aa.c aca aaa gaa att ttt 

Ser Leu Leu'Leu Arg Gly Ala Arg Leu Lys Asn Thr LyS Glu lie Phe 

245 250. ".■ 255 



768 



816 



ggt gtt gcg gta tac act gga atg gaa act aag atg gca tta aat tac 

Gly Val Ala Val Tyr Thr Gly Met Glu Thr Lys Met ,Ala Leu Asn Tyr 
260 , 265 , 270 

, ...... 

aag age aaa tea cag aaa cga tct gca gta gaa aag tea atg aat. aca 864 

Lys Ser Lys Ser 'Gin Lys Arg Ser Ala Val Glu Lys Ser Met Asn Thr 
275, . 280 285 
' i . » 

ttt ttg ata att tat eta gta att ctt ata tct gaa get gtc ate age 912 

Phe Leu IJLe lie Tyr Leu Val lie Leu lie Ser Glu Ala Val He Ser 
290 295 300 

act ate ttg aag tat aca tgg caa get gaa gaa aaa tgg gat gaa cct 960 

Thr He Leu Lys Tyr Thr Trp Gin Ala Glu Glu Lys Trp Asp Glu Pro 

305 310 ( / 315 320 

• . 

tgg tat aac caa aaa aca gaa cat caa aga aat age ' agt aag att et^ 1008 

Trp Tyr Asn Gin Lys 'Thr Glu His Gin Arg Asn Ser Ser Lys He Leu 

325 . 330 335 

aga ttt att tea gac ttc ctt get ttt ttg gtt etc tac aat ttc. ate 1056 

Arg Phe He Ser Asp Phe Leu Ala Phe Leu Val Leu Tyr Asn Phe He 
34 0 34 5 350 

i 

att cca att tea tta tat gtg aca gtc gaa atg cag aaa ttt ctt gga 1104 

He Pro He Ser Leu Tyr Val Thr Val Glu Met Gin Lys Phe Leu Gly 

355 360 365 

tea ttt ttt att ggc tgg gat ctt gat ctg tat cat gaa gaa tea gat 1152 
Ser Phe Phe He Gly Trp Asp Leu Asp Leu Tyr His , Glu Glu Ser Asp 
370 375 380 ' 

cag aaa get caa gtc aat act tec gat ctg aat gaa gag ctt gga cag 1200 
Gin Lys Ala Gin Val Asn Thr Ser Asp Leu Asn Glu Glu Leu Gly Gin 
385 390 395 400 

gta gag tac gtg ttt aca gat aaa act ggt aca ctg aca gaa aat gag 124 8 
Val Glu Tyr Val Phe Thr Asp Lys Thr Gly Thr Leu Thr Glu Asn Glu 
4 05 410 415 

atg cag ttt egg gaa tgt tea att aat ggc atg aaa tac caa gaa att 1296 
Met Gin Phe Arg Glu Cys Ser He Asn Gly Met Lys Tyr Gin Glu He 
420 425 430 

aat ggt aga ctt gta ccc gaa gga cca aca cca gac tct tea gaa gga 134 4 
Asn Gly Arg Leu Val Pro Glu Gly Pro Thr Pro Asp Ser Ser Glu Gly 
435 440 445 

aac tta tct tat ctt agt agt tta tec cat. ctt aac aac tta tec cat 1392 
Asn Leu Ser Tyr Leu Ser Ser Leu Ser. His Leu Asn Asn Leu Ser His 
450 455 460 
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ctt aca acc agt tec tct ttc aga acc agt cct gaa aat gaa. act gaa .1440 
Leu Thr Thr Ser Ser -Ser Phe Arg Thr Ser Pro, Glu Asn Glu . Thr- 'Glu 
465 - - 470 \ 475 . 480 

eta att aaa gaa cat gat etc ttc ttt aaa.gca gtc agt etc tgt cac 1488 , 
Leu He Lys Glu His Asp Leu Phe Phe Lys Ala Val Ser Leu Cys His 
485 490 : ■ 495 

act gta cag att age aat gtt caa act gac tgc act ggt gat ggt ccc 1536 
Thr Val Gin He Ser Asn Val Gin Thr Asp Cys Thr Gly Asp Gly Pro 

500 505 ' 5i0 

■ t ■ 

... • ' i ' ■ 

tgg caa tec aac ctg gca cca teg cag ttg gag tac tat gca tct tea 1584 

Trp Gin Ser Asn Leu Ala Pro Ser Gin Leu Glu Tyr Tyr Ala Ser Ser 

515 , 520 525 , 

•cca gat gaa aag get eta gta gaa get get gca' agg att ggt att gtg 1632 
Pro Asp Glu Lys Ala Leu Val 'Glu Ala Ala Ala Arg lie Gly lie Val 
530 * 535 540 

ttt att ggc aat tct gaa gaa act atg gag gtt aaa act ctt gga aaa 1680 
Phe He Gly Asn Ser Glu Glu Thr Met Glu Val Lys Thr Leu Gly Lys 
545 550 555 560 

ctg gaa egg tac aaa ctg ctt cat att ctg gaa ttt gat tea gat cgt 1728 
Leu Glu Arg Tyr Lys Leu Leu His lie Leu Glu Phe Asp Ser Asp Arg 
565 570 , 575 

agg aga atg agt gta att gtt cag gca cct tea ggt gag aag tta tta 177 6 
Arg Arg Met Ser Val He Val Gin Ala Pro Ser Gly Glu Lys Leu Leu 
580 585 590 

ttt get aaa gga get gag tea tea att etc cct aaa tgt ata ggt gga 1824 
Phe Ala Lys Gly Ala Glu Ser Ser lie Leu Pro Lys Cys He Gly Gly 
595 600 605 

gaa ata gaa aaa acc aga att cat gta gat gaa ttt get ttg aaa ggg 1872 
Glu lie Glu Lys Thr Arg He His Val Asp Glu Phe Ala Leu Lys Gly 
' 610 615 620 



eta aga act ctg tgt ata gca tat aga aaa ttt aaa tea aaa gag tat 
Leu Arg Thr Leu Cys He Ala Tyr Arg Lys Phe Thr Ser Lys Glu Tyr 
625 630 635 640 



1920 



gag gaa "ata gat aaa cgc ata ttt gaa gee agg act gee ttg cag cag 1968 

Glu Glu He Asp Lys Arg He Phe Glu Ala Arg Thr Ala Leu Gin Gin 

645 650 655 

egg gaa gag aaa ttg gca get gtt ttc cag ttc ata gag aaa gac ctg 2016 

Arg Glu Glu Lys Leu Ala Ala Val Phe Gin Phe He Glu Lys Asp Leu 
660 665 670 

i 

ata tta ctt gga gee aca gca gta gaa gac aga eta caa gat aaa gtt 2064 

He Leu Leu Gly Ala Thr Ala Val Glu Asp Arg Leu Gin Asp Lys Val 

675 680 685 

cga gaa act att gaa gca ttg aga atg get ggt ate aaa gta tgg gta 2112 

Arg Glu Thr He Glu Ala Leu Arg Met Ala Gly He Lys Val Trp Val 

690 695 700 
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ctt act ggg gat aaa cat gaa aca get gtt agt gtg agt tta tea tgt 2160 
Leu Thr Gly Asp Lys His <eiu„Thr Ala Val Ser Val Ser Leu Ser Cys. 
705 . " * 7 ^ Q 715 72Q 

ggc cat ttt cat agra acc atg aac ate ctt gaa ctt ata aac cag'aaa 2208 
Gly His Phe> His Arg Thr Met Asn lie 'Leu Glu Leu lie Asn Gin Lys . 

725 730, ' ' 735 

tea gac age gag tgt get gaa caa ttg agg cag ctt gee aga aga att 2256 
Ser Asp Ser Glu Cys Ala Glu Gin Leu Arg Gin Leu, Ala Arg Arg lie 
740 ' 745 750 

aca gag gat cat gtg att cag cat ggg ctg gta gtg gat ggg acc age 2304 
Thr Glu Asp His »Val He Gin His Gly Leu Val Val Asp Gly Thr Ser 
755; , 760 765 . 

eta tct ctt gca etc agg gag cat gaa aaa eta ttt atg gaa gtt tgc 2352 
Leu Ser I^eu Ala Leu Arg Glu His Glu Lys Leu Phe Met Glu Val Cys 
770 * 775 • 780 

aga aat tgt tea get gta tta tgc tgt qgt atg get cca ctig cag aaa 2400 

Arg Asn Cys Ser Ala Val Leu Cys Cys Arg Met Ala Pro Leu Gin Lys 

785 790 | t ■ 795 * 800 

> . 

gca aaa gta ata aga eta ata aaa ata tea cct gag aaa cet ata aca 2448 

Ala Lys Val lie Arg Leu He Lys He Ser Pro Glu Lys Pro He Thr 

805 . 810 . ' 815. 

ttg get gtt ggt gat ggt get aat gac gta age atg ata caa gaa gec 2496 
Leu Ala Val Gly Asp Gly Ala Asn Asp Val Ser Met He Gin Glu Ala 
820 825 830 

cat gtt ggc ata gga ate atg ggt aaa gaa gga aga' cag get gca aga 2544 
His Val Gly lie Gly He Met Gly Lys Glu Gly Arg Gin Ala Ala Arg 
835 840 845 

aac agt gac tat gca ata gee aga ttt aag ttc etc tec aaa ttg ctt 2592 
Asn Ser Asp Tyr Ala He Ala Arg Phe Lys Phe Leu, Ser Lys Leu Leu 
850 855 860 ' 

ttt gtt cat ggt cat ttt tat tat att aga ata get acc ctt gta cag 2640 
Phe Val His Gly His Phe Tyr Tyr He Arg He Ala Thr Leu Val Gin 
865 870 875 880 

tat ttt ttt tat aag aat gtg tgc ttt ate aca ccc cag ttt tta tat 2688 
Tyr Phe Phe Tyr Lys Asn Val Cys Phe lie Thr Pro Gin Phe Leu Tyr 
885 ' 890 895 

cag ttc tac tgt ttg ttt tct cag caa aca ttg tat gac age gtg tac 2736 . 
Gin Phe Tyr Cys Leu Phe Ser Gin Gin Thr Leu Tyr Asp Ser Val Tyr 
900 905 910 

ctg act tta tac aat att tgt ttt act tec eta cct att ctg ata tat 2784 
Leu Thr Leu Tyr Asn lie Cys Phe Thr Ser Leu Pro He Leu He Tyr 
915 920 925 

agt ctt ttg gaa cag cat gta gac cct cat gtg tta caa aat aag ccc 2832 
Ser Leu Leu Glu Gin His Val Asp Pro His Val Leu Gin Asn Lys Pro 
930 935 94 0 



•76- 



02055701 A2 I > 



WO 02/055701 



PCT/US01/49060 



acc ctt tat cga gac att agt aaa aac cgc etc tta agt .att- aaa aca 
Thr Leu Tyr- Arg Asp lie Ser Lys Asn Arg Leu Leu Ser lie Lys Thr 
945 ■ • 950 955 960 



2880 



ttt ctt tat tgg acc ate ctg ggc ttc agt cat gec ttt att ttc ttt 
Phe Leu Tyr Trp Thr lie Leu Gly Phe Ser His Ala Phe lie . Phe Phe 
965 970 ' . 975 



2928 



ttt gga tec tat tta eta ata ggg aaa gat aca tct ctg ctt gga aat 
Phe Gly Ser Tyr Leu Leu lie Gly Lys Asp Thr . Ser Leu Leu Gly Asn 
980 985 ' 990 



2976 



ggc cag atg ttt gga aac tgg aca ttt ggc act ttg gtc ttc aca gtc 
Gly Gin Met Phe Gly Asn Trp Thr Phe Gly Thr Leu Val Phe Thr Val 



995 



1000 



1005 



3024 



»atg gtt att aca gtc aca gta aag atg get ctg gaa act cat ttt tgg 
Met Val lie Thr Val Thr Val Lys Met Ala Leu Glu Thr His Phe Trp 
1010 1015 1020 



3072 



act tgg ate aac cat etc gtt acc tgg gga tct att ata ttt tat ttt 
Thr Trp lie Asn His Leu Val Thr Trp Gly Ser lie lie Phe Tyr Phe 
1025 * 1030 . 1035 ~' .1040 



3120 



gta ttt tec ttg ttt tat gga ggg att etc tgg cca ttt ttg ggc tec 3168 
Val Phe Ser Leu Phe Tyr Gly Gly lie Leu Trp Pro Phe Leu Gly Ser ■ 
1045 " 1050 , 1055 



cag aat atg tat ttt gtg ttt att cag etc ctg tea agt ggt tct get 
Gin Asn Met Tyr Phe Val Phe lie Gin Leu Leu Ser Ser Gly Ser Ala 
1060 10 65 . 1070 



3216 



tgg ttt gee ata ate etc atg gtt gtt aca tgt eta ttt ctt gat ate 
Trp Phe Ala lie lie Leu Met Val Val Thr Cys Leu Phe Leu Asp lie 
1075 1080 1085 



3264 



ata aag aag gtc ttt gac cga cac etc cac cct aca agt act gaa aag 
lie Lys Lys Val Phe Asp Arg His Leu His Pro Thr Ser Thr Glu Lys 
' 1090 1095 1100 



3312 



gca cag ctt act gaa aca aat gca ggt ate aag t^c ttg gac tec atg 
Ala Gin Leu Thr Glu Thr Asn Ala Gly lie Lys Cys Leu Asp Ser Met 
' 1105 1110 1115 1120 



3360 



tgc tgt" ttc ccg gaa gga gaa gca gcg tgt gca tct gtt gga aga atg 
Cys Cys Phe Pro Glu Gly Glu Ala Ala Cys Ala Ser Val Gly Arg Met 
1125 " 1130 1135 



34 08 



ctg gaa cga gtt ata gga aga tgt agt cca acc cac ate age agt tea 
Leu Glu Arg Val lie Gly Arg Cys Ser Pro Thr His lie Ser Ser Ser 
1140 1145 1150 



3456 



tgg agt gca teg gat cct ttc tat acc aac gac agg age ate ttg act 
Trp Ser Ala Ser Asp Pro Phe Tyr Thr Asn Asp Arg Ser lie Leu Thr 
1155 " 1160 1165 



3504 



etc tec aca atg gac tea tct act tgt taa 
Leu Ser Thr Met Asp Ser Ser Thr Cys * 
1170 1175 



3534 
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<210> 22 
<211> 4198 
<212> DNA 



<213> Homo sapiens 



<220> 
<221> CDS 
<222> (156) 



. (3410) 



<400> 22 

ggagtcgacc cacgcgtccg cattgagaca atgcctccac aaatacttga tgcaaaattc 60* . 

agtaagacag cacttgttga- atcaccatta tag.tttc.tga caaattgttc tcaaaaaggt 120. 

accagctgga ggatgagtct gcgcatttgg atgaa atg cca eta atg atg tct .173 

Met Pro Leu Met Met Ser 

, • 1 , 5 



'gaa gaa ggc ttt gag aat gag <gaa agt gat tac cac acc tta cca cga 
Glu Glu Gly Phe Glu Asn Glu Glu Ser Asp Tyr His Thr Leu Pro Arg 



10 



15 



20 



221 



gec agg ata atg caa agg aaa aga gga ctg gag tgg ttt. gtc tgt gat 269 
Ala Arg lie Met Gin Arg Lys Arg Gly Leu Glu Trp Phe Val Cys Asp 
25 30 35 

ggc tgg aag ttc etc tgt acc agt tgc tgt ggt tgg ctg ata aat att 317 
Gly Trp Lys Phe Leu Cys Thr Ser Cys Cys Gly Trp Leu lie Asn lie 
40 45 50 . 

tgt cga aga aag aaa gag ctg aaa get cgc aca gta tgg ctt gga tgt 365 
Cys Arg Arg Lys Lys Glu Leu Lys Ala Arg Thr Val Trp Leu Gly Cys 
55 60 65 70 

cct gaa aag tgt gaa gaa aaa cat ccc agg aat tct ata aaa aat caa 413 
Pro Glu Lys Cys Glu Glu Lys His Pro Arg Asn Ser He Lys Asn Gin 
75 " 80 85 

aaa tac aat gtg ttt acc ttt ata cct ggg gtt ttg tat gaa caa ttc 4 61 
Lys Tyr Asn Val Phe Thr Phe He Pro Gly Val Leu Tyr Glu Gin Phe 
90 95 100 

aag ttt ttc ttg aat etc tat ttt eta gtg ata tec tgc tea cag ttt 509 
.Lys Phe Phe Leu Asn Leu Tyr Phe Leu Val He Ser Cys Ser Gin Phe 
105 110 115 

gta ccagca ttg aaa ata ggc tat etc tac acc tac tgg get cct ctg 557 
Val Pro Ala Leu Lys lie Gly Tyr Leu Tyr Thr Tyr Trp Ala Pro Leu 
120 125 130 

gga ttt gtc ttg get gtt act atg aca egg gaa gca att gat gaa ttt 605 
Gly Phe Val Leu Ala Val Thr Met Thr Arg Glu Ala He Asp Glu Phe 
135 140 145 150 

egg cgt ttt cag cgt gac aag gaa gtg aat tea caa eta tat age aag 653 
Arg Arg Phe Gin Arg Asp Lys Glu Val Asn Ser Gin Leu Tyr Ser Lys 
155 160 165 



ctt aca gta aga ggt aaa gtg caa gtt aag agt tea gac ata caa gtt 
Leu Thr Val Arg Gly Lys Val Gin Val Lys Ser Ser Asp He Gin Val 
170 175 180 



701 
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gga gac etc ate ata gtg gaa aag aat caa aga att cca teg cjac atg 74 9. 

Gly Asp Leu He He Val ,Glu tl Lys Asn Gin Arg lie Pro Ser Asp Met, 
185 M 190 '■ 195 

gtg ttt ctt agg act tea gaa aaa gca ggt teg tgt ttt att cga* Act 797 

Val Phe Leu. Arg Thr Ser Glu Lys Ala ' (Sly Ser Cys Phe He Arg Thr 
200 205 210 \ 



gat caa eta gat ggt gaa act gac tgg aag ctg aag gtg gca gtg age 
Asp Gin Leu Asp Gly Glu Thr Asp Trp Lys Leu Lys, Val Ala Val Ser 
215 220. 225 230 



845 



tgc acg caa cag ctg ccg get ctg ggg gac ctt ttt tct ate agt get 893 
Cys Thr Gin Gin. Leu Pro Ala Leu Gly Asp Leu Phe Ser He Ser Ala 
• ( 235 240 245 

tat gtt tat get cag aaa cca caa atg gac att cac agt ttc gaa ggc 941 
Tyr Val Tyr Ala Gin Lys Pro Gin Met Asp lie His Ser' Phe Glu Gly 
250 255- 260 

aca ttt ace agg gaa gac agt gac ccg ccc att cat gaa agt. etc age 989 
Thr Phe Thr Arg Glu Asp Ser Asp Pro Pro lie His Glu Ser Leu Ser 

265 270 275 

» ♦ 

ata gaa aat aca ttg tgg gca age acc att gtt gca* tea 'ggt act gta 1037 
He Glu Asn Thr Leu Trp Ala Ser Thr He Val Ala Ser Gly Thr Val 
280 285 , 290 

ata ggt gtt gtc att tat acc gga aaa gag act cga agt gta atg aac 1085 
He Gly Val Val He Tyr Thr Gly Lys Glu Thr Arg Ser Val Met Asn 
295 ~ 300 305 310 

aca tec aat cca aaa aat aag gtt ggt t'tg ttg gac' ctt gaa etc aat 1133 
Thr Ser Asn Pro Lys Asn Lys Val Gly Leu Leu Asp Leu Glu Leu Asn 
315 320 325 

egg ctg acg aaa gcg eta ttt ttg get tta gtt get ctt tec att gtt 1181 
Arg Leu Thr Lys Ala Leu Phe Leu Ala Leu Val Ala Leu Ser He Val 
330 335 ' 340 

atg gta acc tta caa gga ttt gtg ggt cca tgg tac cgc aat ctt ttt 1229 
Met Val Thr Leu Gin Gly Phe Val Gly Pro Trp Tyr Arg Asn Leu Phe 
345 350 355 

egg ttc ctt etc etc ttt tct tac ate att ccc ata agt ttg cgt gtg 1277 
Arg Phe Leu Leu Leu Phe Ser Tyr lie He Pro lie Ser Leu Arg Val 
360 365 370 

aac ttg gac atg ggc aaa gcg gtg tat gga tgg atg atg atg aaa gat. 1325 
Asn Leu Asp Met Gly Lys Ala Val Tyr Gly Trp Met Met Met Lys Asp. 
375 380 385 390 

gag aac ate cct ggc acg gtc gtt egg acc age act ate cca gag gaa 1373 
Glu Asn He Pro Gly Thr Val Val Arg Thr Ser Thr He Pro Glu Glu 
395 400 405 

ctt ggg cgc ctg gtg tat tta ttg aca gac aaa aca gga acc etc acc 1421 
Leu Gly Arg Leu Val Tyr Leu Leu Thr Asp Lys Thr Gly Thr Leu Thr 
410 415 420 
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cag aat gaa atg ata ttt aag egg ctg eac'ctg. ggc acegtg tec tat 1469 

Gin Asn Glu Met lie, >Phe >iys..Arg Leu His Leu Gly Thr Val Ser Tyr- 
. 425 430 ' 435 

• s • , r 

ggc gec gac acg atg gat gag ate cag •; age cat gtc agg gac tec tac 1517 

Gly Ala Asp» Thr Met Asp Glu lie Gin Ser His Val Afg Asp Ser Tyr . 

440 445 4.50 ' ' 

tea cag atg cag tct caa get ggt gga aac aat act ggt tea act cca 1565 

Ser Gin Met Gin Ser Gin Ala Gly. Gly Asn Asn Thr, Gly Ser Thr Pro' 

455 4 60 4 65 470 '. 

I . 1 ' * ■ • • '' , 

eta aga aaa gee caa tct tea get ccc aaa gtt agg aaa agt gtc agt 1613 ' ■ 

Leu Arg Lys Ala* Gin Ser Ser Ala Pro Lys Val Arg Lys Ser Val Ser 
., 475. 480 - 485 
i ' • 

agt cga ate cat gaa gee gtg aaa gee ate gtg ctg tgt cac aac gtg 1661 

Ser Arg Hie His. Glu Ala Val Lys Ala lie Val Leu Cys His Asn Val 
490 495' 500 

acc ccc gtg tat gag tct egg gee ggc gtt act gag gag act gag ttc 1709 

Thr Pro Val Tyr Glu Ser Arg Ala Gly Veil Thr Glu Glu Thr Glu Phe 
505 , £10 515 

gca gag get gac caa gac ttc agt gat gag aat cgc ace tac cag get 1757 

Ala Glu Ala Asp Gin' Asp Phe Ser Asp Glu Asn Arg Thr Tyr Gin Ala 
520 525 • 530 

tec age ccg gat gag gtc get ctg gtg cag tgg aca gag agt gtg ggc 1805 

Ser Ser Pro Asp Glu Val Ala Leu Val Gin Trp Thr Glu Ser Val Gly 

535 540 545 550 

etc acg ctg gtc age agg gac etc acc tec atg cag ctg aag ace cce 1853 

Leu Thr Leu Val Ser Arg Asp Leu Thr Ser Met Gin Leu Lys Thr Pro 
555 560 565 

agt ggc cag gtc etc age ttc tgc att ctg cag, ctg ttt ccc ttc acc 1901 

Ser Gly Gin Val Leu Ser Phe Cys lie Leu Gin Leu Phe Pro Phe Thr 
570 575 ' 580 

tec gag age aag egg atg ggc gtc ate gtc agg gat gaa tec acg gca 1949 

Ser Glu Ser Lys Arg Met Gly Val lie Val Arg Asp Glu Ser Thr Ala 
585 590 595 

gaa ate aca ttc tac atg aag ggc get gac gtg gee atg tct cct ate 1997 

Glu lie Thr Phe Tyr Met Lys Gly Ala Asp Val Ala Met Ser Pro lie 
600 605 610 

gtg cag tat aat gac tgg ctg gaa gag gag tgc gga aac atg get cgc 2045 

Val Gin Tyr Asn Asp Trp Leu Glu Glu Glu Cys Gly Asn Met Ala Arg 

615 620 625 630 

gaa gga ctg egg acc etc gtg gtt.gca aag aag gcg ttg aca gag gag 2093 

Glu Gly Leu Arg Thr Leu Val Val Ala Lys Lys Ala Leu Thr Glu Glu 
635 640 645 

cag .tac cag gac ttt gag age cga tac act caa gee aag ctg age atg 2141 

Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr Gin Ala Lys . Leu Ser Met 
650 655 660 
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cac gac agg tec etc aag gtg gec gcg gta gtc gag age ctg, gag agg . 2189 

His Asp Arg Ser Leu Lys Val Ala Ala Val Val Glu Ser Leu. Glu Arg 
665 670 675 

gag atg gaa ctg ctg tgc etc acc ggc gtg gag gac cag otg cag gca 2237 , 
Glu Met Glu Leu Leu Cys Leu Thr Gly Val Glu Asp Gin Leu Gin Ala 
680 685 690 

gac gtg egg ccc acg ctg gag atg ctg cgc aac gec ggg ate aag ata 2285 
Asp Val Arg Pro Thr Leu Glu Met Leu Arg Asn. Ala Gly He Lys He 
695 70Q ' ' 705 ; 710 

- i » - * 

tgg atg eta aca ggc gat aaa etc gag aca get acc tgc att gee aaa. 2333 
Trp Met Leu Thr Gly Asp Lys Leu Glu Thr Ala Thr Cys lie Ala Lys 
715 , 720 , 725 

»agt tea cat etc gtg tct aga aca caa gat att" cat att ttc aga cag 2381 
Ser Ser His Leu Val Ser Arg 'Thr Gin Asp He His lie Phe Arg Gin 
730 735 740 

gta acc agt egg gga gag gca cat ttg gag ctg aat gca ttt cga agg 2429 
Val Thr Ser Arg Gly Glu Ala His Leu Glu Leu Asn Ala Phe Arg Arg 
745 " 750 755 

aag cat gat tgt gca eta gtc ata tct. ggg gac tct ctg gag gtt tgt 2477 
Lys His Asp Cys Ala Leu Val lie Ser Gly Asp Ser Leu Glu Val Cys 
760 765 770 , 



eta aag tac tac gag cat gaa ttt gtg gag ctg gee tgc cag tgc cct 
Leu Lys Tyr Tyr Glu His Glu Phe Val Glu Leu Ala Cys Gin Cys Pro 
775 " 780 785 790 



2525 



gee gtg gtt tgc tgc cgc tgc tea ccc acc cag aag gee cgc att gtg 2573 
Ala Val Val Cys Cys Arg Cys Ser Pro Thr Gin Lys Ala Arg He Val 
795 800 805 

aca ctg ctg cag cag cac aca ggg aga cgc acc tgc gee ate ggt gat 2621 
Thr Leu Leu Gin Gin His Thr Gly Arg Arg Thr Cys Ala lie Gly Asp 
810 815 820 

gga gga aat gat gtc age atg att cag gca gca gac tgt ggg att ggg 2669 
Gly Gly Asn Asp Val Ser Met lie Gin Ala Ala Asp Cys Gly He Gly 
825 830 835 

att gag" gga aag gag ggt aaa cag gec teg ctg gcg gee gac ttc tec 2717 
He Glu Gly Lys Glu Gly Lys Gin Ala Ser Leu Ala Ala Asp Phe .Ser 
840 845 850 

ate acg cag ttc egg cac ata ggc agg ctg etc atg gtg cac ggg egg 2765 
'He Thr Gin Phe Arg His He Gly Arg Leu Leu Met Val His Gly Arg 
855 860 865 870 



aac age tac aag agg teg gcg gca etc ggc cag ttc gtc atg cac agg 

Asn Ser Tyr Lys Arg Ser Ala Ala Leu Gly Gin Phe Val Met His Arg 
.875 880 885 

ggc ctt ate ate tec acc atg cag get gtg ttt tec tea gtc ttc tac 

Gly Leu lie He Ser Thr Met Gin Ala Val Phe Ser Ser Val Phe Tyr 

890 8 95 900 
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ttc gca tec gtc cct ttg tat cag ggc ttc etc atg gtg ggg tat gec 
Phe Ala Ser Val Pro Leu Tyr Gin Gly Phe Leu> Met Val Gly Tyr Ala 
905 910 915 



. 2909 



acc ata tac acc atg ttc cca gtg ttc tec tta gtg ctg gac cag gac 
Thr lie Tyr Thr Met Phe Pro Val Phe Ser Leu Va,l Leu Asp .Gin Asp 
920 925 930 



2957 



gtg aag cca gag atg gcg atg etc tac ccg gag ctg tac aag gac etc 
Val Lys Pro Glu Met Ala Met Leu Tyr Pro Glu Leu Tyr Lys Asp Leu 
935 94q. ' 945 1 950 



3005 



acc aag gga aga tec ttg tec ttc aaa acc ttc etc ate tgg gtt tta 
Thr Lys' Gly Arg Ser Leu Ser Phe Lys Thr Phe Leu lie Trp Val Leu 
955 , 960 , 965 



.3053 



*ata agt att tac caa ggc ggc ate etc atg tat ggg gee ctg gtg etc 
lie Ser lie Tyr Gin Gly Gly lie Leu Met Tyr Gly Ala Leu Val Leu 
970 975 980 



3101 



1 ttc gag tct gag ttc gtc cac gtg gtg gec ate tec ttc acc gca ctg 
Phe Glu Ser Glu Phe Val His Val Val Ala He Ser Phe Thr Ala Leu 
985 990 995 



3149 



ate ctg acc gag ctg ctg atg gtg gcg ctg acc gtc cgc acg tgg cac 
He Leu Thr Glu Leu Leu Met Val Ala Leu Thr Val Arg Thr Trp His 
1000 1005 1010 i. 



3197 



tgg ctg atg gtg gtg gee gag ttq etc age tta ggc tgc tac gtg tec 
Trp Leu Met Val Val Ala Glu Phe Leu Ser Leu Gly Cys Tyr Val Ser 
1015 1020 1025 ~ 1030 



3245 



tea etc get ttt etc aat gaa tat ttt gat gtt gee ttt ate acc acc 
Ser Leu Ala Phe Leu Asn Glu Tyr Phe Asp Val Ala Phe He Thr Thr 
1035 1040 1045 



3293 



gtg acc ttc ctg tgg aaa gtg teg gcg ate acc gtg gtc age tgc etc 
Val Thr Phe Leu Trp Lys Val Ser Ala He Thr Val Val Ser Cys Leu 
1050 1055 1060 



3341 



.ccg ctg tat gtc etc aag tac ctg agg cgc aag tqt tct cct ccc age 
, Pro Leu Tyr Val Leu Lys Tyr Leu Arg Arg Lys Ser Ser Pro Pro Ser 
1065 1070 1075 



3389 



tac tgc aag ctg gee tec taa ggggctgtgc acccccagcg ggctggcccc 
Tyr Cys Lys Leu Ala Ser * 
1080 



3440 



agcaccttct 
gctaccaagc 
ctgagacagc 
cagatgecag 
gggcatagat 
agcccagggc 
aggcaagccc 
cctgccaggc 
ctcacccctg 
gcttgagttt 
ttactctggc 
gattctttct 
ctccacggag 



gcccttccca 
aagcaccaca 
ctctccttct 
gatggcttct 
gctgagacag 
acagaggecg 
agggcacaga 
aagcccaggg 
cttttctttc 
ctactgaaaa 
attctgagaa 
ttgatgttat 
aaaaaaaaaa 



gcaccttgtg 
agaaagggag 
cagtgcaggg 
ccctctcagt 
cctctccctc 
ggacggcctc 
tgecgggatg 
cacagatget 
tttttttgta 
tgagcccgaa 
ttagactgaa 
aacacaaagt 
aaaaaaaaaa 



cccttgccag 
ggtacgccag 
acgtcacccc 
gegaggctte 
teagtgeagg 
tccctctcag 
gcccctccct 
gcgatggcct 
ttgtcaaaat 
ttatttcact 
agtttaattt 
cattcctact 
aaaaaaaa 



tgaaegcagg 
gcgagcccag 
tgccaggcaa 
acccctgcca 
gaegtcaccc 
tgtgaggctt 
etcagtgegg 
cttcctctta 
tgtatttcca 
attactgtaa 
ctgcagttcc 
caaatgtaat 



gtttgecatt 
ggcacagatg 
gcccagggca 
ggcaagccca 
ctgccaggca 
cacccatgct 
gaacgtcacc 
agtgtggggc 
tattgaagca 
agggttcatc 
ctcatattca 
aaaattgagg 



3500 
3560 
3620 
3680 
3740 
3800 
3860 
3920 
3980 
4040 
4100 
4160 
4198 
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<210> 23 M . n .. § . ■ ' • . ' 

<211> 1084 

<212> PRT , , . . , r 

<213> Homo sapiens ; ( 

<400> 23 

Met Pro Leu Met Met .Ser Glu Glu Gly Phe Glu Asn Glu Glu Ser Asp 

1 5 . 10 15 

Tyr His Thr Leu Pro Arg Ala Arg lie Met Gin Arg, Lys Arg Gly Leu 

20 25 30 

Glu Trp Phe Val Cys Asp G^y Trp Lys Phe Leu 'Cys Thr Ser Cys Cys 

35 ' • 40 45 

Gly Trp Leu He 'Asn lie Cys Arg Arg Lys Lys Glu Leu Lys Ala Arg 

50 , . 55 60 • 

Thr Val Trp Leu Gly, Cys Pro iGlu Lys Cys Glu Glu Lys His Pro Arg 
65 70 75 80 

Asn Ser I,le Lys Asn Gin Lys Tyr Asn Val Phe Thr Phe He Pro Gly 

* 85 "90 • 95 

Val Leu Tyr Glu Gin Phe Lys Phe Phe Leu Asn Leu Tyr Phe Leu Val 

* 100 105 . ' HO 

lie Ser Cys Ser Gin Phe Val Pro Ala Leu Lys lie Gly Tyr Leu Tyr 

115 120 125 

Thr Tyr Trp Ala Pro Leu Gly Phe Val Leu Ala Val Thr Met Thr Arg 
130 135 ' 140 1 

Glu Ala lie Asp Glu • Phe Arg Arg Phe Gin Arg Asp Lys Glu Val Asn 
145 150 . 155 160 

Ser Gin Leu Tyr Ser Lys Leu Thr Val Arg Gly Lys Val Gin Val Lys 

165 170 175 

Ser Ser Asp He Gin Val Gly Asp Leu He lie Val Glu Lys Asn Gin 

180 185 190 

Arg He Pro Ser Asp Met Val Phe Leu Arg Thr Ser Glu Lys Ala Gly 

195 200 ' 205 

Ser Cys Phe lie Arg Thr Asp Gin Leu Asp Gly Glu Thr Asp Trp Lys 

210 215 220 

Leu Lys Val Ala Val Ser Cys Thr Gin Gin Leu Pro Ala Leu Gly Asp 
225 230 235 240 

Leu Phe Ser lie Ser Ala Tyr Val Tyr Ala Gin Lys, Pro Gin Met Asp 

245 250 * 255 

He His Ser Phe Glu Gly Thr Phe* Thr Arg Glu Asp Ser Asp Pro Pro 

260 265 270 

He His Glu Ser Leu Ser lie Glu Asn Thr Leu Trp Ala Ser Thr lie 

275 280 285 

Val Ala Ser Gly Thr Val He Gly Val Val He. Tyr Thr Gly Lys Glu 

290 295 ' 300 

Thr Arg Ser Val Met Asn Thr Ser Asn Pro Lys Asn Lys Val Gly Leu 
305 310 315 320 

Leu Asp Leu Glu Leu Asn Arg Leu Thr Lys Ala Leu Phe Leu Ala Leu 

325 330 335 

Val Ala Leu Ser He Val Met Val Thr Leu Gin Gly Phe Val Gly Pro 

340 345 350 

Trp Tyr Arg Asn Leu Phe Arg Phe Leu Leu Leu Phe . Ser Tyr He lie 

355 360 365 

Pro He Ser Leu Arg Val Asn Leu Asp Met Gly Lys Ala Val Tyr Gly 

370 375 380 

Trp Met Met Met Lys Asp Glu Asn He Pro Gly Thr Val Val Arg Thr 
385 390 395 400 

Ser Thr He Pro Glu Glu Leu Gly Arg Leu Val Tyr Leu Leu Thr. Asp 

405 « 410 415 

Lys Thr Gly Thr Leu Thr Gin Asn Glu Met He Phe Lys Arg Leu His 
420 425 430 
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Leu Gly Thr Val Ser Tyr Gly Ala Asp. Thr Met Asp Glu lie Gin Ser 

435 440 • ■ . 4 45 

His Val Arg Asp Ser Tyr iSer Gin Mel: Gin Ser Gin .Ala Gly Gly Asn 
T5~0 ~ '■ 4~5*5 ; : 7~ , , 4bu j " : : ~ 

Asn Thr Gly Ser Thr Pro Leu Arg Lys Ala Gin Ser Ser Ala Pro Lys 
465 ! 470 - .475 , . 480 

Val Arg Lys Ser Val Ser Ser Arg lie His Glu Ala Val Lys Ala lie 

485 4 90 . ' - 4 95 

Val Leu Cys His Asn Val Thr Pro Val Tyr Glu Ser Arg Ala Gly Val 

500 505 510 

Thr Glu Glu Thr Glu Phe Ala Glu Ala Asp Gin Asp Phe Ser Asp Glu 

515 . 520 , 525 

Asn Arg Thr Tyr Gin Ala Ser Ser Pro Asp Glu Val Ala Leu Val Gin 

530 535 540 

Trp Thr Glu Ser Val Gly Leu Thr Leu Val Ser Arg Asp I,eu Thr Ser 
545 550 . 555'. , 560- 

'Met Gin Leu Lys Thr Pro Ser Gly Gin Val Leu Ser Phe. Cys lie Leu 
' 565 ' 570 575 . 

Gin Leu Phe Pro Phe Thr Ser Glu Ser Lys Arg Met Gly Val He Val 

580 585 590 

Arg Asp Glu Ser Thr Ala Glu He Thr Phe Tyr Met Lys Gly Ala Asp 

595 600 605 

Val Ala Met Ser Pro He Val Gin Tyr Asn Asp Trp Leu Glu Glu. Glu 

610 615 ■ 620 

Cys Gly Asn Met Ala Arg Glu Gly Leu Arg Thr Leu Val Val Ala Lys 
625 630 635 640 

Lys Ala Leu Thr Glu Glu Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr 

645 650 655 

Gin Ala Lys Leu Ser Met His Asp Arg Ser Leu Lys Val Ala Ala Val 

660 665 670 

Val Glu Ser Leu Glu Arg Glu Met Glu Leu Leu Cys Leu Thr Gly Val 

675 680 685 

Glu Asp Gin Leu Gin Ala Asp Val Arg Pro Thr Leu Glu Met Leu Aig 

690 695 700 

Asn Ala Gly He Lys He Trp Met Leu Thr Gly Asp Lys Leu Glu Thr 
705 710 715 720 

Ala Thr Cys He Ala Lys Ser Ser His Leu Val Ser Arg Thr Gin Asp 

725 730 735 

lie His He Phe Arg Gin Val Thr Ser Arg Gly Glu Ala His Leu Glu 

740 745 -750 * 

Leu Asn Ala Phe Arg Arg Lys His Asp Cys Ala Leu Val He Ser Gly 

755 760 765 

Asp Ser Leu Glu Val Cys Leu Lys Tyr Tyr Glu His Glu Phe Val Glu 

770 775 . 780 

Leu Ala Cys Gin Cys Pro Ala Val Val Cys Cys Arg Cys Ser Pro Thr 
785 790 795 -800 

Gin Lys Ala Arg He Val Thr Leu Leu Gin Gin His Thr Gly Arg Arg 

805 810 815 

Thr Cys Ala He Gly Asp Gly Gly Asn Asp Val Ser Met He Gin Ala 

820 825 830 

Ala Asp Cys Gly He Gly He Glu Gly Lys Glu Gly Lys Gin Ala Ser 

835 840 845 

Leu Ala Ala Asp Phe Ser lie Thr Gin Phe Arg His lie Gly Arg. Leu 

850 855 860 

Leu Met Val His Gly Arg Asn Ser Tyr Lys Arg Ser Ala Ala Leu Gly 
865 870 875 880 

Gin Phe Val Met His Arg Gly Leu He He Ser Thr Met Gin Ala Val 

885 890 895 

Phe Ser Ser Val Phe Tyr Phe Ala Ser Val Pro Leu Tyr Gin Gly Phe 
900 905 910 
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Leu Met Val Gly Tyr Ala Thr lie Tyr Thr Met Phe. Pro Val Phe Ser 

915 - ,,920 , - 925 , 

Leu Val Leu Asp Gin* ksp Val Lys Pro Glu Met Ala Met Leu Tyr -Pro 





930 










935 




94U 




Thr 


Glu 


Leu 


Tyr 


Lys 


Asp 


Leu 


Thr 


Lys Gly. Arg Ser Leu Ser Phe 


Lys 1 


945 




i 






950 






955 • 




960 


Phe 


Leu 


lie 


Trp 


Val 


Leu 


He 


Ser 


He Tyr Gin Gly • Gly He. 


Leu 


Met 










965 








970 


975 




Tyr 


Gly 


Ala 


Leu 


Val 


Leu 


Phe 


Glu 


Ser Glu Phe Val His Val 


Val 


Ala 




980 










985 " 990 






He 


Ser 


Phe 


Thr 


Ala 


Leu 


He 


Leu 


Thr Glu Leu Leu Met Val 


Ala 


Leu 






995 








1000 i . 1005 






Thr 


Val 


Arg 


Thr 


Trp 


His 


Trp 


Leu 


Met Val. Val Ala Glu Phe 


Leu 


Ser 




1010 








1015 


1020 






Leu 


Gly Cys 


Tyr Val Ser 


Ser 


Leu 


Ala Phe Leu Asn Glu Tyr 


Phe 


Asp 


1025 








1030 




1035 




1040 


Val 


Ala 


Phe 


He 


Thr 


Thr 


Val 


Thr 


Phe Leu Trp Lys Val Ser 


Ala 


lie 










1045 






1050 


.1055 


Thr 


Val 


Val 


Ser 


Cys 


Leu 


Pro 


Leu 


Tyr- Val Leu Lys Tyr Leu 


Arg Arg 








1060 








1065 . 1070 




Lys 


Ser 


. Ser 


Pro 


Pro 


Ser 


Tyr 


Cys 


Lys Leu Ala Ser ' \ 







1075 1080 



<210> 24 

<211> 3255 ' . i 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . . (3255) 

<400> 24 ' 

atg cca eta atg atg tct gaa ,gaa ggc ttt gag aat gag gaa agt gat 4 8 
Met Pro Leu Met Met Ser Glu Glu Gly Phe Glu Asn Glu Glu Ser Asp 
1 5 10 15 

tac cac acc tta cca cga gec agg ata atg caa agg aaa aga gga ctg 96 
Tyr His Thr Leu Pro Arg Ala Arg He Met Gin Arg Lys Arg Gly Leu 
20 • 25 • 30 

gag tgg ttt gtc tgt gat ggc tgg aag ttc etc tgt acc agt tgc'tgt 144 
Glu Trp Phe Val Cys Asp Gly Trp Lys Phe Leu Cys Thr Ser Cys Cys 
35 40 45 

ggt tgg ctg ata aat at t tgt cga aga aag aaa gag ctg aaa get cgc 192 
Gly Trp Leu He Asn He Cys Arg Arg Lys Lys Glu Leu Lys Ala Arg 
50 55 60 

aca gta tgg ctt gga tgt cct gaa aag tgt gaa gaa aaa cat ccc agg 24 0 
Thr Val Trp Leu Gly Cys Pro Glu Lys Cys Glu Glu Lys His Pro Arg 
65 70 75 80 

aat tct ata aaa aat caa aaa tac aat gtg ttt acc ttt ata cct ggg 288 
Asn Ser He Lys Asn Gin Lys Tyr Asn Val Phe Thr Phe He Pro Gly 
85 90 95 

gtt ttg tat gaa .caa ttc aag ttt ttc ttg aat etc tat ttt eta gtg 336 
Val Leu Tyr Glu Gin Phe Lys Phe Phe Leu Asn Leu Tyr Phe Leu Val 
100 . 105 110 
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ata tec tgc tea cag ttt gta cca gca ttg aaa ata ggc tat etc tac 384. 
lie Ser Cys Ser Gin t phe »Val-,.Pro Ala Leu Lys lie Gly Tyr Leu Tyr. 
115 120 125 

acc tac tgg get cct ctg gga ttt gtc ttg get gtt act atg aca' egg 432 
Thr Tyr Trp» Ala Pro Leu Gly Phe Val *Leu Ala Val Thr Met Thr Arg . 
130 135 140" ' 

gaa gca att gat gaa ttt egg cgt ttt cag cgt gac aag gaa gtg aat 480 

Glu Ala lie Asp Glu Phe Arg Arg Phe Gin Arg Asp, Lys Glu Val Asn 
145 150 155, 160 ' 

. 1 ' .... 

tea caa eta tat age aag ctt aca gta aga ggt aaa gtg caa gtt aag 528 

Ser Gin Leu Tyr » Ser Lys Leu Thr Val Arg Gly Lys Val Gin Val Lys 
, 165 , 170 . - 175 

agt tea gac ata caa gtt gga gac etc ate ata gtg gaa aag aat caa 57 6 
Ser Ser ^sp lie Gin Val Gly Asp Leu lie lie Val Glu Lys Asn Gin 
180 185 ' 190 

aga att cca teg gac atg gtg ttt ctt agg act tea gaa aala gca ggt 624 
Arg lie Pro Ser Asp Met Val Phe. Leu Arg Thr Ser Glu Lys Ala Gly 
195 f 200 205 

teg tgt ttt att cga act gat caa eta gat ggt gaa" act gac tgg aa^ 672 
Ser Cys Phe lie Arg • Thr Asp Gin Leu Asp Gly Glu Thr Asp Trp Lys 
210 215 , 220 

ctg aag gtg gca gtg age tgc acg caa cag ctg ccg get ctg ggg gac 720 
Leu Lys Val Ala Val Ser Cys Thr Gin Gin Leu Pro Ala Leu Gly Asp 
225 230 235 240 

ctt ttt tct ate agt get tat gtt tat get cag aaa cca caa atg gac 7 68 
Leu Phe Ser He Ser Ala Tyr Val Tyr Ala Gin Lys Pro Gin Met Asp 
245 250 255 

att cac agt ttc gaa ggc aca ttt acc agg gaa gac agt gac ccg ccc 816 
He His Ser Phe Glu Gly Thr Phe Thr Arg Glu Asp, Ser Asp Pro Pro 
260 265 ' 270 

att cat gaa agt etc age ata gaa aat aca ttg tgg gca age acc att 864 
He His Glu Ser Leu Ser He Glu Asn Thr Leu Trp Ala Ser Thr lie 
275 280 285 

gtt gca tea ggt act gta ata ggt gtt gtc att tat acc gga aaa gag 912 
Val Ala Ser Gly Thr Val lie Gly Val Val He Tyr Thr Gly Lys Glu 
290 295 300 

act cga agt gta atg aac aca tec aat cca aaa aat aag gtt ggt ttg 960 
Thr Arg Ser Val Met Asn Thr Ser Asn Pro Lys Asn Lys Val Gly Leu 
305 310 315 320 

ttg gac ctt gaa etc aat egg ctg acg aaa gcg eta ttt ttg get tta 1008 
Leu Asp Leu Glu Leu Asn Arg Leu Thr Lys Ala Leu Phe Leu Ala Leu 
325 330 335 

gtt get ctt tec att gtt atg gta acc tta caa gga ttt gtg ggt cca 1056 
Val Ala Leu Ser He Val Met Val Thr Leu Gin Gly Phe Val Gly Pro 
340 345 350 
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tgg tac cgc aat ctt . ttt egg ttc ctt etc etc ttt tct tac- ate att . 1104 
Trp Tyr Arg Asn -Leu Phe Arg Phe Leu Leu Leu Phe Ser Tyr lie' lie 
355 - 360 - 365 - ■ 

• • i ' , • . ' 

ccc ata, agt ttg cgt gtg aae ttg gac atg ggc aaa gcg gtg tat gga 1152 
Pro lie Ser Leu Arg Val Asn Leu Asp Met Gly Lys Ala Val . Tyr Gly 
370 375 3,80 

tgg atg atg atg aaa gat gag aae ate cct ggc acg gte gtt egg ace 1200 

Trp Met Met Met Lys Asp Glii Asn lie Pro Gly Thr Val Val Arg Thr . 

385 . , 390,- ' 395 ' 400 

i 1 ■ 

age act ate cca gag gaa ctt ggg cgc ctg gtg tat tta ttg aca gac . 1248 

Ser Thr lie Pro Glu Glu Leu Gly Arg Leu Val Tyr Leu Leu Thr Asp 

405 » 410 415 

'aa^ aca gga ace etc acc cag aat gaa atg ata ttt aag egg ctg cac 1296 
Lys Thr Gly Thr Leu Thr Gin Asn Glu Met lie Phe Lys Arg Leu His 
420 425 4 30 

ctg ggc acc gtg tec tat ggc gee gac acg atg gat gag ate cag age 134 4 
Leu Gly Thr Val Ser Tyr Gly Ala Asp Thr Met Asp Glu lie Gin Ser 
435 440 445 

cat gtc agg gac tec tac tea cag atg cag tct caa get ggt gga aac 1392. 
His Val Arg Asp Ser Tyr Ser Gin Met Gin Ser Glh Ala Gly Gly Asn 
450 455 460 • 

aat act ggt tea act cca eta aga aaa gec caa tct tea get ccc aaa 1440 
Asn Thr Gly Ser Thr Pro Leu Arg Lys Ala Gin Ser Ser Ala Pro Lys 
465 470 475 480 

gtt agg aaa agt gtc agt agt cga ate cat gaa gee gtg aaa gee ate 14 88 
Val Arg Lys Ser Val Ser Ser Arg lie His Glu Ala Val Lys Ala lie 
4 85 4 90 4 95 

gtg ctg tgt cac aac gtg acc ccc gtg tat gag tct egg gee ggc gtt 1536 
Val Leu Cys His Asn Val Thr Pro Val Tyr Glu Ser Arg Ala Gly Val 
500 505 510 

act gag gag act gag ttc gca gag get gac caa gac ttc agt gat gag 1584 
.Thr Glu Glu Thr Glu Phe Ala Glu Ala Asp Gin Asp Phe Ser Asp Glu 
515 520 . 525 

aat cgc acc tac cag get tec age ceg gat gag gtc' get ctg gtg cag 1632 
Asn Arg Thr Tyr Gin Ala Ser Ser Pro Asp Glu Val Ala Leu Val Gin 
530 - 535 540 

tgg aca gag agt gtg ggc etc acg ctg gtc age agg gac etc acc tec 1680 
Trp Thr Glu Ser Val Gly Leu Thr Leu Val Ser Arg Asp Leu Thr Ser 

545 550 555 560 

i 

atg cag ctg aag acc ccc agt ggc cag gtc etc age ttc tgc att ctg 1728 
Met Gin Leu Lys Thr Pro Ser Gly Gin Val Leu Ser Phe Cys lie Leu 
565 570 575 

cag ctg ttt ccc ttc acc tec gag age aag egg atg ggc gtc ate gtc 1776 
Gin Leu Phe Pro Phe Thr Ser Glu Ser Lys Arg Met Gly Val He Val 
580 585 590 
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agg gat gaa tec acg gca gaa ate aca ttc tap atg aag ggc get gac . 1824 
Arg Asp Glu Ser Thr Ala Glu He- Thr Phe Tyr» Met Lys Gly Ala Asp 
595 600- . 605 

gtg gee atg tct cct ate gtg cag tat aatgac tgg ctg gaa gag gag 1872 . 
Val Ala Met Ser Pro lie Val Gin Tyr Asn Asp Trp Leu Glu .Glu Glu ' 
610 615 620 

tgc gga aac atg get cgc gaa gga ctg egg ace etc gtg gtt gca aag 1920 
Cys Gly Asn Met Ala Arg Glu Gly Leu Arg Thr. Leu Val Val Ala Lys 
625 630 '635 : 640 

• ' . . ■ i ■ . 

aag gcg ttg aca gag gag cag tac cag gac ttt gag age cga tac act .1968 
Lys Ala Leu Thr Glu Glu Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr 
645 , 650 , 655 

taa gec aag ctg age atg cac ( gac agg tec etc aag gtg gee gcg gta 2016 
Gin Ala Lys Leu Ser Met His Asp Arg Ser Leu Lys Val Ala Ala Val 
660 . 665 670 

gtc gag age ctg gag agg gag atg gaa ctg ctg tgc etc ace ggc gtg 2064 
Val Glu Ser Leu Glu Arg Glu Met Glu Leu Leu Cys Leu Thr Gly Val 
675 680 685 

gag gac cag ctg cag gca gac gtg egg ccc acg ctg gag atg ctg cgc 2112 
Glu Asp Gin Leu Gin Ala Asp Val Arg Pro Thr Leu Glu Met Leu Arg 
690 695 700 i. 

aac gec ggg ate aag ata tgg atg eta aca ggc gat aaa etc gag aca 2160 
Asn Ala Gly He Lys lie Trp Met Leu Thr Gly Asp Lys Leu Glu Thr 
705 710 715 720 

get acc tgc att gec aaa agt tea cat etc gtg tct aga aca caa gat 2208 
Ala Thr Cys lie Ala Lys Ser Ser His Leu Val Ser Arg Thr Gin Asp 
725 730 735 

att cat att ttc aga cag gta acc agt egg gga gag gca cat ttg gag 2256 
He His He Phe Arg Gin Val Thr Ser Arg Gly Glu Ala His Leu Glu 
740 745 750 

ctg aat gca ttt cga agg aag cat gat tgt gca eta gtc ata tct ggg 2304 
,Leu Asn Ala Phe Arg Arg Lys His Asp Cys Ala Leu Val lie Ser Gly 
755 760 765 

gac tct 'ctg gag gtt tgt eta aag tac tac gag cat gaa ttt gtg gag 2352 
Asp Ser Leu Glu Val Cys Leu Lys Tyr Tyr Glu His Glu Phe Val Glu 
770 775 780 

ctg gee tgc cag tgc cct gec gtg gtt tgc tgc cgc tgc tea ccc acc 2400 
Leu Ala Cys Gin Cys Pro Ala Val Val Cys Cys Arg Cys Ser Pro Thr 
785 790 ■" 795 800 

cag aag gec cgc att gtg aca ctg ctg cag cag cac aca ggg aga cgc 2448 
Gin Lys Ala Arg He Val Thr Leu Leu Gin Gin His Thr Gly Arg Arg 
805 810 815 

acc tgc gee ate ggt gat gga gga aat gat gtc age atg att cag gca 2496 
Thr Cys Ala He Gly Asp Gly Gly Asn Asp Val Ser Met He Gin Ala 
820 825 830 
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gca. gac tgt ggg a'tt ggg att gag gga aag gag ggt aaa^ cag gcc teg 

Ala- Asp Cys Gly He, ply *Zle„Giu Gly Lys Glu Gly Lys Glsn Ala Ser. 
... ... 835 4 840 845 

• * i » f 

ctg gcg gcc gac ttc tec ate acg cag ttc egg cae ata ggc agg ctg 

Leu Ala Ala* Asp Phe Ser lie Thr Gin 'Phe Arg His lie Gly Arg Leu 

850 855 ; 860".' 



2544 



2592 



etc atg gtg 
Leu Met Val 
865 

cag ttc gtc 
Gin Phe Val 



ttt tec tea 
Phe Ser S^er 



cac ggg egg aac age tac aag agg teg gcg gca etc ggc 

His Gly Arg Asn Ser tyr Lys Arg Ser, Ala Ala Leu Gly 

870 875 880 

» • 

i 

atg cac agg ggc ctt ate ate tec ace atg cag get gtg 

Met -His Arg Gly Leu He He Ser Thr Met Gin Ala Val 
885 , 890 • . 895 

i 1 

gtc ttc tac ttc gca tec gtc cct t.tg tat cag ggc ttc 

Val Phe Tyr Phe Ala Ser Val Pro Leu Tyr Gin Gly Phe. 
900 905' 910 



2640 



2688 



2736 



etc atg gtg ggg tat gcc acc ata tap acc atg ttc cca gt)g ttc tec 2784 
Leu Met Val Gly Tyr Ala Thr lie Tyr Thr Met Phe Pro Val Phe Ser 
915 ■ • ( ?20 925 

tta gtg ctg gac cag gac gtg aag cca gag atg gcg atg etc tap .ccg 
Leu Val Leu Asp Gin* Asp Val Lys Pro Glu Met Ala Met Leu Tyr Pro 
930 935 . 940 

gag ctg tac aag gac etc acc aag gga aga tec ttg tec ttc aaa acc 
Glu Leu Tyr Lys Asp Leu Thr Lys Gly Arg Ser Leu Ser Phe Lys Thr 
945 950 955 960 

ttc etc ate tgg gtt tta ata agt att tac caa ggc' ggc ate etc atg 
Phe Leu He Trp Val Leu He Ser lie Tyr Gin Gly Gly He Leu Met 
965 970 975 

tat ggg gcc ctg gtg etc ttc gag tct gag ttc gtc cac gtg gtg gcc 
Tyr Gly Ala Leu Val Leu Phe Glu Ser Glu Phe Val, His Val Val Ala 
980 985 ' 990 

ate tec ttc acc gca ctg ate ctg acc gag ctg ct ; g atg gtg gcg ctg 
He Ser Phe Thr Ala Leu He Leu Thr Glu Leu Leu Met Val Ala Leu 
995 1000 1005 

acc gtc cgc acg tgg cac tgg ctg. atg gtg gtg gcc gag ttc etc age 
Thr Val Arg Thr Trp His Trp Leu Met Val Val Ala Glu Phe Leu Ser 
1010 1015 1020 

tta ggc tgc tac gtg tec tea etc get ttt etc aat gaa tat ttt gat 
Leu Gly Cys Tyr Val Ser Ser Leu Ala Phe Leu Asn Glu Tyr Phe Asp 
1025 1030 1035 1040 

gtt gcc ttt ate acc acc gtg acc ttc ctg tgg aaa gtg teg. gcg ate 3168 
Val Ala Phe He Thr Thr Val Thr Phe Leu Trp Lys Val Ser Ala He 
1045 1050 1055 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



acc gtg gtc age tgc etc ccg ctg tat gtc etc aag tac ctg agg cgc 
Thr Val Val Ser Cys. Leu Pro Leu Tyr Val Leu Lys Tyr Leu Arg Arg 
1060 1065 1070 



3216 
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aag tct tct cct ccc age tac tgc aag ctg gec tec taa 3255 
Lys Ser Ser Pro Pro, ger ; *Tyr r.Cys Lys Leu Ala Ser * • , 
1075 ■ ~ 1080 



<210> 25 V 
<211> 4231 
<212> DNA 

<213> Homo sapiens 

i 

<220> 

<221> CDS . ■ ' . i ' 1 . 

<222> (156) ... (3443) 

<400> 25 i 

ggagtcgacc cacgcgtccg cattgatjaca atgcctccac aaatacttga tgeaaaatte 60 
agtaagacag cacttgitga atcaccatta tagtttctga caaattgttc tcaaaaaggt 120 
accagctgg£ ggatgagtct . gcgcatttgg atgaa atg cca eta atg atg tct 173 

Met Pro Leu Met Met Ser 
.1 1 5 
■ . 1 

gaa gaa ggc ttt gag aat gag gaa agt ga't tac cac ace tta cca cga 221 
Glu Glu Gly Phe Glu Asn Glu, Giu Ser Asp Tyr His Thr Leu Pro Arg 
10 1 15 ,20 

gec agg ata atg caa'agg aaa aga gga ctg gag tgg ttt gtc tgt gat 269 
Ala Arg lie Met Gin Arg Lys Arg Gly Leu Glu Trp Phe Val Cys Asp 
25 30 35 

ggc tgg aag ttc etc tgt acc agt tgc tgt ggt tgg ctg ata aat att 317 
Gly Trp Lys Phe Leu Cys Thr Ser Cys Cys Gly Trp Leu lie Asn lie 
40 45 , ' 50, 

tgt cga aga aag aaa gag ctg aaa get cgc aca gta tgg ctt gga tgt 3 65 
Cys Arg Arg Lys Lys Glu Leu Lys Ala Arg Thr Val Trp Leu Gly Cys 
55 60 65 70 

cct gaa aag tgt gaa gaa aaa cat ccc agg aat tct, ata aaa aat caa 413 
Pro Glu Lys Cys Glu Glu Lys His Pro Arg Asn Ser lie Lys Asn Gin 
75 80 85 

aaa tac aat gtg ttt acc ttt ata cct ggg gtt ttg tat gaa caa ttc 4 61 
Lys Tyr Asn Val Phe Thr Phe lie Pro Gly Val Leu Tyr Glu Gin Phe 
90 95 100 

aag ttt ttc ttg aat etc tat ttt eta gtg ata tec tgc tea cag ttt 509 
Lys Phe Phe Leu Asn Leu Tyr Phe Leu Val lie Ser Cys Ser Gin Phe 
105 110 115 

gta cca gca ttg aaa ata ggc tat etc tac acc tac tgg get cct ctg 557 
Val Pro Ala Leu Lys lie Gly Tyr Leu Tyr Thr Tyr Trp Ala Pro Leu 
120 125 130 

gga ttt gtc ttg get gtt act atg aca egg gaa gca att gat gaa ttt 605 
Gly Phe Val Leu Ala Val Thr Met Thr Arg Glu Ala lie Asp Glu Phe 
135 140 145 . 150 

egg cgt ttt cag cgt. gac aag gaa gtg aat tea caa eta tat age aag 653 
Arg Arg Phe Gin Arg Asp Lys Glu Val Asn Ser Gin Leu Tyr Ser Lys 
155 160 . 165 ' 
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ctt aca gta aga ggt aaa gtg caa gtt aag agt tea gac ata caa gtt 701 
Leu Thr Val Arg Gly liys Val Gin Val Lys Ser Ser Asp lie Gin Val 
170 175 . . '.' * 180 * 

gga gac etc ate ata gtg gaa aag'aat caa acja att caa t;cg gac atg 74 9 
Gly Asp Leu lie lie Val Glu Lys Asn Gin Arg lie Pro Ser Asp Met 
185 190 ' 195 

gtg ttt ctt agg act tea gaa aaa gca ggt teg tgt ttt att cga act 797 
Val Phe. Leu Arg Thr Ser Glu Lys Ala Gly Ser Cys Phe lie Arg Thr 
200 , 205 ' ■ • 210 ; 

gat caa eta gat ggt gaa act gac . tgg aag c*tg aag gtg gca gtg age 845. 
Asp Gin Leu Asp Gly Glu Thr Asp Trp Lys Leu Lys Val Ala Val Ser 
215 220 225 230 

,tgc acg caa cag ctg ccg get ctg ggg gac ctt ttt tct ate agt get 893 . 
Cys Thr Gin Gin Leu Pro Ala 'Leu Gly Asp Leu Phe Ser lie Ser Ala 
* 235 240. 245 

tat gtt tat get cag aaa cca caa atg gac att cac agt ttc gaa ggc 941 
Tyr Val Tyr Ala Gin Lys Pro Gin Met Asp lie His Ser Phe Glu Gly 
250 255 . 260 

aca ttt ace agg gaa gac agt gac ccg ccc att cat gaa agt etc age 989 
Thr Phe Thr Arg Glu Asp Ser Asp Pro Pro lie His Glu Ser Leu Ser 
265 270 275 

r 

ata gaa aat aca ttg tgg gca age acc att gtt gca tea ggt act gta 1037 
lie Glu Asn Thr Leu Trp Ala Sei: Thr lie Val Ala Ser Gly Thr Val 
280 285 290 

ata ggt gtt gtc att tat acc gga aaa gag act cga agt gta atg aac 1085 
lie Gly Val Val lie Tyr Thr Gly Lys Glu Thr Arg Ser Val Met Asn 
295 300 305 310 

aca tec aat cca aaa aat aag gtt ggt ttg ttg gac ctt gaa etc aat 1133 
Thr Ser Asn Pro Lys Asn Lys Val Gly Leu Leu Asp Leu Glu Leu Asn 
315 320 325 

egg ctg acg aaa gcg eta ttt ttg get tta gtt get ctt tec att gtt 1181 
Arg Leu Thr Lys Ala Leu Phe Leu Ala Leu Val Ala Leu Ser lie Val 
330 335 340 

atg gta-acc tta caa gga ttt gtg ggt cca tgg tac cgc aat ctt ttt 1229 
Met Val Thr Leu Gin Gly Phe Val Gly Pro Trp Tyr Arg Asn Leu Phe 
345 350 355 

egg ttc ctt etc etc ttt tct tac ate att ccc ata agt ttg cgt gtg 1277 
Arg Phe Leu Leu Leu Phe Ser Tyr lie lie Pro lie Ser Leu Arg Val 
360 365 370 

• aac ttg gac atg ggc aaa gcg gtg tat gga tgg atg atg atg aaa gat 1325 
Asn Leu Asp Met Gly Lys Ala Val Tyr Gly Trp Met Met Met Lys Asp 
375 380 385 390 

gag aac ate cct ggc acg gtc gtt egg acc age act ate cca gag gaa 1373 
Glu Asn lie Pro Gly Thr Val Val Arg Thr Ser Thr lie Pro Glu Glu 
395 400 405 
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ctt ggg cgc ctg gtg tat tta ttg aca gac aaa aca gga acc etc acc 1421 

Leu Gly Arg Leu : -Val , Jyr £eu nLeu Thr Asp Lys Thr Gly Thar Leu .Tfrr » 
. 410 415 . 420 

• i ■. , r 

cag aat gaa atg ata ttt aag egg ctg ;cac ctg ggc acc gtg tec tat 14 69 

Gin Asn Glu' Met lie Phe Lys Arg Leu His Leu Gly Thr Val Ser Tyr 
425 430 '4'35 

ggc gee gac acg atg gat gag ate cag age cat gtc agg gac tec tac 1517 

Gly Ala Asp Thr Met Asp Glu lie Gin Ser His Val , Arg Asp Ser Tyr 

440 " , 445 , 450 

tea cag atg cag tct caa get ggt gga aac aat act ggt tea act cca 1565 ' 

Ser Gin Met Gin 'Ser Gin Ala Gly Gly Asn Asn Thr Gly Ser Thr Pro 

455 ' , ,460 465 470 



eta aga aaa gee caa tct tea get ccc aaa gtt agg aaa agt gtc agt 
Leu Arg I^ys Ala Gin Ser Ser Ala Pro Lys Val Arg Lys Ser Val Ser 
475 480 485 



1613 



agt cga ate cat gaa gee gtg aaa gee ate gtg ctg tgt ca'c aac gtg 1661 
Ser Arg lie His Glu Ala Val Lys Ala lie Val Leu Cys His Asn Val 
4 90 ( , ' 4 95 500 

acc ccc gtg tat gag tct egg gee ggc gtt act gag* gag act gag ttt 1709 
Thr Pro Val Tyr Glu 'Ser Arg Ala Gly Val Thr Glu Glu Thr Glu Phe 
505 510 . 515 

gca gag get gac caa gac ttc agt gat gag aat cgc acc tac cag get 1757 
Ala Glu Ala Asp Gin Asp Phe Ser Asp Glu Asn Arg Thr Tyr Gin Ala 
520 525 530 

tec age ccg gat gag gtc get ctg gtg cag tgg aca gag agt gtg ggc 1805 
Ser Ser Pro Asp Glu Val Ala Leu Val Gin Trp Thr Glu Ser Val Gly 
535 540 545 550 

etc acg ctg gtc age agg gac etc acc tec atg cag ctg aag acc ccc 1853 
Leu Thr Leu Val Ser Arg Asp Leu Thr Ser Met Gin, Leu Lys Thr Pro 
555 ~ 560 1 565 

agt ggc cag gtc etc age ttc tgc att ctg cag ctg ttt ccc ttc acc 1901 
Ser Gly Gin Val Leu Ser Phe Cys lie Leu Gin Leu Phe Pro Phe Thr 
570 575 580 

tec gag age aag egg atg ggc gtc ate gtc agg gat gaa tec acg gca 194 9 
Ser Glu Ser Lys Arg Met Gly Val lie Val Arg Asp Glu Ser Thr Ala 
585 590 595 

gaa ate aca ttc tac atg aag ggc get gac gtg gee atg tct cct ate 1997 
Glu lie Thr Phe Tyr Met Lys Gly Ala Asp Val Ala Met Ser Prq lie 
600 605 610 

gtg cag tat aat gac tgg ctg gaa gag gag tgc gga aac atg get cgc 2045 
Val Gin Tyr Asn Asp Trp Leu Glu Glu Glu Cys Gly Asn Met Ala Arg 
615 620 625 630 

gaa gga ctg egg acc etc gtg gtt gca aag aag gcg ttg aca gag gag 2093 
Glu Gly Leu Arg Thr Leu Val Val Ala Lys Lys Ala Leu Thr Glu. Glu 
635 640 645 
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cag tap cag gac ttt gag age cga tap act caa gec aag ctg age atg . 2141 

Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr Gin Ala Lys Leu Ser Met 

650 _\ . 655 660 

cac gac agg tec etc aag gtg gee gcg gta gtc gag age ctg gag agg 2189 . 

His Asp Arg Ser Leu Lys Val Ala Ala Val Val Glu Ser Leu .Glu Arg 
665 670 \ ' 675 

gag atg gaa ctg ctg tgc etc ace ggc gtg gag gac cag ctg cag gca 2237 

Glu Met Glu Leu Leu Cys Leu Thr Gly Val Glu. Asp Gin Leu Gin Ala 

680 "685 • 690 ; 

v , 
■ ' i 1 

gac gtg egg ccc acg ctg gag atg ctg cgc.aac gee ggg ate aag ata .2285 

Asp Val Arg Pro Thr Leu Glu Met Leu Arg Asn Ala Gly lie Lys lie 

695 700 , 705 " " . 710 

'tgg atg eta aca ggc gat aaa etc gag aca get" ace tgc att gec aaa 2333 

Trp Met Leu Thr Gly Asp Lys Leu Glu Thr Ala Thr Cys lie Ala Lys 
' 715 720 725 

agt tea cat etc gtg tct aga aca caa gat att cat att ttc aga cag 2381 

Ser Ser His Leu Val Ser Arg Thr Gin Asp lie His lie Phe Arg Gin 

730. 735 740 

gta ace agt egg gga gag gca cat ttg gag ctgaat gca ttt cga agg 2429 

Val Thr Ser Arg Gly Glu Ala His Leu Glu Leu Asn Ala Phe Arg Arg ' 
745 750 755 

aag cat gat tgt gca eta gtc ata tct ggg gac tct ctg gag gtt tgt 2477 

Lys His Asp Cys Ala Leu Val lie Ser Gly Asp Ser Leu Glu Val Cys 
760 765 770 

eta aag tac tac gag cat gaa ttt gtg gag ctg gee tgc cag tgc cet 2525 

Leu Lys Tyr Tyr Glu His Glu Phe Val Glu Leu Ala Cys Gin Cys Pro 
775 780 785 790 

gee gtg gtt tgc tgc cgc tgc tea ccc ace cag aag gee cgc att gtg 2573 

Ala Val Val Cys Cys Arg Cys Ser Pro Thr Gin Lys Ala Arg lie Val 
795 800 805 

aca ctg ctg cag cag cac aca ggg aga cgc ace tgc gee ate ggt gat 2621 

Thr Leu Leu Gin Gin His Thr Gly Arg Arg Thr Cys Ala lie Gly Asp 

810 815 820 

gga gga "aat gat gtc age atg att cag gca gca gac tgt ggg att ggg 2669 

Gly Gly Asn Asp Val Ser Met lie Gin Ala Ala Asp Cys Gly lie Gly 
825 830 835 

att gag gga aag gag ggt aaa cag gee teg ctg gcg gec gac ttc tec 2717 
lie Glu Gly Lys Glu Gly Lys Gin Ala Ser Leu Ala Ala Asp Phe Ser 
.840 845 850 

ate acg cag ttc egg cac ata ggc agg ctg etc atg gtg cac ggg egg. 2765 
He Thr Gin Phe Arg His He Gly Arg Leu Leu Met Val His Gly Arg 
855 860 865 " 870 

aac age tac aag agg teg gcg gca etc ggc cag ttc gtc atg cac agg 2813 

Asn Ser Tyr Lys Arg Ser Ala Ala Leu Gly Gin Phe Val Met His Arg 
875 " 880 885 
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ggc ctt ate ate tec ace atg cag get gtg'ttt tec tea gtc. ttc tac 2861 

Gly Leu lie He Ser Thr Met Gin Ala Val Phe, Ser Ser Val Phe Tyr 
890 . 895 900 

ttc gca tec' gtc cct ttg tat cagggc ttc etc atg gttj ggg tat gec 2909 

Phe Ala Ser Val Pro Leu Tyr Gin Gly Phe Leu Met Val Gly Tyr Ala " 

905 910 • ' 915 



ace ata tac ace atg ttc cca gtg ttc tec tta gtg ctg gac cag gac 
Thr He. Tyr Thr Met Phe Pro Val Phe Ser Leu Val Leu Asp Gin Asp 

920 925 ' ' " 930 .. : " 



2957 



gtg aag cca gag atg gcg atg etc tac ccg gag ctg tac aag gac etc 
Val Lys Pro Glu. Met Ala Met Leu Tyr Pro Glu Leu Tyr Lys Asp Leu 
935 94 0 94 5 950 



3005 



,acc aag gga aga tec ttg tec ttc aaa acc ttc etc ate tgg gtt tta 
Thr Lys Gly Arg Ser Leu Ser 'Phe Lys Thr Phe Leu lie Trp Val Leu 
- 955 960 965 



3053 



a.ta agt att tac caa ggc ggc ate etc atg tat ggg gec ctg gtg etc 3101 
He Ser He Tyr Gin Gly Gly lie Leu Met Tyr Gly Ala Leu Val Leu 
970 • 975 980 



ttc gag tct gag ttc gtc cac gtg gtg gec ate tec ttc acc gca ctg 
Phe Glu Ser Glu Phe Val His Val Val Ala lie Ser Phe Thr Ala Leu 
985 990 ,995 



3149 



ate ctg acc gag ctg ctg atg gtg gcg ctg acc gtc cgc acg tgg cac 
He Leu Thr Glu Leu Leu Met Val Ala Leu Thr Val Arg Thr Trp His 
1000 1005 1010 



3197 



tgg ctg atg gtg gtg gee gag ttc etc age tta ggc tgc tac gtg tec 
Trp Leu Met Val Val Ala Glu Phe Leu Ser Leu Gly Cys Tyr Val Ser 
1015 1020 1025 1030 



324 5 



tea etc get ttt etc aat gaa tat ttt ggt ata ggc aga gtg tct ttt 
Ser Leu Ala Phe Leu Asn Glu Tyr Phe Gly He Gly Arg Val Ser Phe 
1035 1040 1045 



3293 



gga get ttc tta gat gtt gec ttt ate acc acc gtg acc ttc ctg tgg 
Gly Ala Phe Leu Asp Val Ala Phe lie Thr Thr Val Thr Phe Leu Trp 
. 1050 1055 1060 



3341 



aaa gtg ■ teg gcg ate acc gtg gtc age tgc etc" ccg ctg tat gtc etc 
Lys Val Ser Ala He Thr Val Val Ser Gys Leu Pro Leu Tyr Val ^eu 
1065 1070 1075 



338 9 



aag tac ctg agg cgc aag tct tct cct ccc age tac tgc aag ctg gec 
Lys Tyr Leu Arg Arg Lys Ser Ser Pro Pro Ser Tyr Cys Lys Leu Ala 
1080 1085 1090 



3437 



tec taa ggggctgtgc acccccagcg ggctggcccc agcaccttct gcccttccca 

Ser * 

1095 



3493 



gcaccttgtg cccttgccag tgaaegcagg gtttgecatt gctaccaagc aagcaccaca 3553 
agaaagggag ggtacgccag gcgagcccag ggcacagatg ctgagacagc ctctccttct 3613 
cagtgcaggg acgtcacccc tgccaggcaa gcccagggca cagatgecag gatggcttct 3673 
ccctctcagt gegaggctte acccctgcca ggcaagccca gggcatagat gctgagacag 3733 
cctctccctc teagtgeagg gacgtcaccc ctgccaggca agcccagggc acagaggecg 3793 
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ggacggcctc' tccctctcag. tgtgaggctt cacccatgct aggcaagccc agggcacaga 3853 

tgccgggatg gcccctccct ctcagtgcgg gaaogtcacc cctgccaggd aagccoaggg 3913 

cacagatget gcgatggcct cttcctctta agtgtggggc ct cacccctg cttttctttc 3973 

tttttttgta ttgtcaaaat tgtatttcca tattgaagca gcttgagttt ctac;tgaaaa . 4"03T" 

tgagcccgaa ttatttcact attactgtaa agggttcatc ttactctggc attctgagaa 4093 

ttagactgaa. 'agtttaattt ctgcagttcc ctcatattca gattctttct ttgatgttat 4153 

aacacaaagt cattcctact caaatgtaat aaaattgagg ctccacggag aaaaaaaaaa 4213 

aaaaaaaaaa aaaaaaaa 4231 

<210> 26 i 
<211> 1095 . 

<212> PRT i ' ' 
<213> Homo sapiens 



<400> 26 i 
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• 
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100 
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Asn 


Leu 


Tyr 


Phe 
110 


Leu 


Val 


lie 


Ser 


Cys 
115 


Ser 


Gin 


Phe 


Val 


Pro 
120 


Ala 


Leu 


Lys 


lie 


Gly 
125 


Tyr 


Leu 


Tyr 


Thr 


Tyr 
130 


Trp 


Ala 


Pro 


Leu 


Gly 
135 


Phe 


Val 


Leu 


Ala 


Val. 
140 


Thr 


Met 


Thr 


Arg 


Glu 


Ala 


He 


Asp Glu 


Phe 


Arg Arg 


Phe 


Gin 


Arg 


Asp 


Lys 


Glu 


Val 


Asn 


145 










150 










155 










160 


Ser 


Gin 


Leu 


Tyr 


Ser 
165 


Lys 


Leu 


Thr 


Val 


Arg 
170 


Gly 


Lys 


Val 


Gin 


Val 
175 


Lys 


Ser 


Ser Asp 


lie 


Gin 


Val 


Gly Asp 


Leu 


lie 


lie 


Val 


Glu Lys 


Asn 


Gin 








180 










185 










190 






Arg 


lie 


Pro 


Ser Asp 


Met 


Val 


Phe' 


Leu 


Arg 


Thr 


Ser 


Glu 


Lys 


Ala 


Gly 






195 










200 










205 








Ser 


Cys 
210 


Phe 


He 


Arg 


Thr 


Asp 
215 


Gin 


Leu 


Asp 


Gly 


Glu 
220 


Thr 


Asp 


Trp 


Lys 


Leu 


Lys 


Val 


Ala 


Val 


Ser 


Cys 


Thr 


Gin 


Gin 


Leu 


Pro 


Ala 


Leu 


Gly Asp 


•225 










230 










235 










240 


Leu 


Phe 


Ser 


He 


Ser 


Ala 


Tyr Val Tyr 


Ala 


Gin 


Lys 


Pro 


Gin 


Met 


Asp 










245 










250 










255 




He 


His 


Ser 


Phe 


Glu Gly Thr 


Phe 


Thr 


Arg 


Glu Asp 


Ser Asp 


Pro 


Pro 








260 










265 










270 






He 


His 


Glu 

275 


Ser 


Leu 


Ser 


He 


Glu 
280 


Asn 


Thr 


Leu 


Trp 


Ala 
285 


Ser 


Thr 


He 


Val 


Ala 


Ser 


Gly 


Thr 


Val 


He 


Gly 


Val 


Val 


He 


Tyr 


Thr Gly 


Lys 


Glu 




290 










295 










300 










Thr 


Arg 


Ser 


Val 


Met 


Asn 


Thr 


Ser 


Asn 


Pro 


Lys 


Asn 


Lys 


Val 


Gly Leu 


305 










310 










315 










320 


Leu 


Asp 


Leu 


Glu 


Leu 


Asn Arg 


Leu 


Thr 


Lys 


Ala 


Leu 


Phe 


Leu 


Ala 


Leu 










325 










330 










335 




Val 


Ala 


Leu 


Ser 


He 


Val 


Met 


Val 


Thr 


Leu 


Gin 


Gly 


Phe 


Val 


Gly Pro 








340 










345 










350 






Trp 


Tyr Arg 


Asn 


Leu 


Phe 


Arg 


Phe 


Leu 


Leu 


Leu 


Phe 


Ser 


Tyr 


He 


lie 






355 










360 










365 
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Pro lie Ser Leu Arg Val Asn Leu Asp Met Gly Lys Ala Val Tyr Gly 

370 M 375" . • 380 • . 

Trp Met Met Met Lys Asp Glu Asn lie Pro Gly , Thr Val Val Arg Thr 
"385 [ ~ ~ " 390 ~ " 395 : " ~W 

Ser Thr He Pro Glu Glu Leu Gly Arg ,I,eu Val Tyr Leu Leu Thr Asp. 

. ' 405 410 ( ' 415 

Lys Thr Gly Thr Leu Thr Gin Asn Glu Met- lie Phe lys Arg Leu His 

420 425 430 

Leu Gly Thr Val Ser Tyr Gly Ala Asp Thr Met Asp Glu lie Gin Ser 

435 440 ; »445 

His Val Arg Asp Ser Tyr Ser Gin Met Gin Ser, Gin Ala Gly Gly Asn 

450 455 ' '460. 

Asn Thr Gly Ser Thr Pro Leu Arg Lys Ala Gin Ser Ser Ala Pro Lys 
465 , ' 470 475 480 

Val Arg Lysi Ser Val Ser Ser Arg He His Glu Ala Val Lys Ala lie 

4 85» " 1 4 90 4 95 

Val Leu Cys His Asn Val Thr Pro Val Tyr Glu Ser Arg Ala Gly Val . 

» " 500 505 510 

Thr Glu Glu Thr Glu Phe Ala Glu Ala Asp Gin Asp Phe Ser Asp Glu. 

515. 520 . ' 525 

Asn Arg Thr Tyr Gin Ala Ser Ser Pro Asp Glu Val Ala Leu Val Gin 

530 . 535 s 540 

Trp Thr Glu Ser Val Gly Leu, Thr Leu Val Ser Arg Asp Leu Thr Ser 
545 550 ; ( 555 , 560 

Met Gin Leu L^fS Thr Pro Ser Gly Gin Val Leu Ser Phe Cys He Levi 

565 ' 570 , 575 

Gin Leu Phe Pro Phe Thr Ser Glu Ser Lys Arg Met Gly Val lie Val 

580 585 590 

Arg Asp Glu Ser Thr Ala Glu lie Thr Phe Tyr Met Lys Gly Ala Asp 

595 600 605 

Val Ala Met Ser Pro He Val Glri Tyr Asn Asp Trp Leu Glu Glu Glu 

610 615 , 620 , . 

Cys Gly Asn Met Ala Arg Glu Gly Leu Arg Thr Leu Val Val Ala Lys 
625 » 630 635 640 

Lys Ala Leu Thr Glu Glu Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr 

645 650 655 

Gin Ala Lys Leu Ser Met His Asp Arg Ser Leu Lys Val Ala Ala Val 

660 665 ' • 670 

Val Glu Ser Leu Glu Arg Glu Met Glu Leu Leu Cys Leu Thr Gly Val 

675 680' 685 * 

Glu Asp Gin Leu Gin Ala Asp Val Arg Pro Thr Lieu Glu Met Leu Arg 

690 695 700 

Asn Ala Gly He Lys lie Trp Met Leu Thr Gly Asp Lys Leu Glu Thr 
705 710 ( 715 720 

Ala Thr Cys lie Ala Lys Ser Ser Hi's Leu Val Ser Arg Thr Gin Asp 

725 730 7 35 

lie His He Phe Arg Gin Val Thr Ser Arg Gly Glu Ala His Leu Glu 

740 . 745 750 

Leu Asn Ala Phe Arg Arg Lys His Asp Cys Ala Leu Val He Ser Gly 

755 760 765 

Asp Ser Leu Glu Val Cys Leu Lys Tyr Tyr Glu His Glu Phe Val Glu 

770 775 780 

Leu Ala Cys Gin Cys Pro Ala Val Val Cys Cys Arg Cys Ser Pro Thr 
785 790 795 800 

Gin Lys Ala Arg lie. Val Thr Leu Leu Gin Gin His Thr Gly Arg Arg 
805 810 815 

v Thr Cys Ala He Gly Asp Gly Gly Asn Asp Val Ser Met lie Gin Ala 
820 - 825 830 

Ala Asp Cys Gly lie Gly He Glu Gly Lys Glu Gly Lys Gin Ala Ser 
835 840 845 
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Leu Ala 


Ala Asp 


Phe 


Ser 


He Thr 


Gin 


Phe Arg His lie Gly Arg 


Leu 


850 










855 








8*60 


• 








Leu Met 


Val 


His 


Gly Arg Asn Ser 


Tyr Lys 


Arg 


Ser 


Ala 


Ala 


Leu 


Gly 


865 


• • 






870 








97.5 




» 




• 


8 80 


Gin Phe 


Val 


Met 


His Arg 


Gly Leu 


He 


lie 


Ser 


Thr 


Met 


Gin 


Ala 


Val 








885 








8.90 




• 






895 




Phe Ser 


Ser 


Val 


Phe 


Tyr 


Phe Ala 


Ser 


Val 


Pro 


Leu 


Tyr 


Gin 


Gly 


Phe 






900 








905 










910. 






Leu Met 


Val 


Gly 


Tyr 


Ala 


Thr He 


Tyr 


Thr 


Met 


Phe 


Pro 


Val 


Phe 


Ser 




915 








920 


» 








925 








Leu Val 


Leu 


Asp 


Gin 


Asp, 


Val Lys 


Pro 


Glu 


Met 


Ala 


Met 


Leu 


Tyr 


Pro 


930 








935 






i 


940 


i 








Glu Leu 


Tyr 


Lys 


Asp 


Leu 


Thr Lys 


Gly 


Arg 


Ser 


Leu. 


Ser 


Phe 


Lys 


Thr 


945 








950 








955 










960 


Phe Leu 


lie 


Trp 


Val 


Leu 


He Ser 


lie 


Tyr 


Gin 


Gly 


Gly 


He 


Leu 


Met . 








965 








970 










97 5 




Tyr Gly 


Ala 


Leu 


Val 


Leu 


Phe , Glu 


Ser 


Glu 


Phe 


Val 


His 


Val 


Val 


Ala 




980 








985 










990 






lie Ser 


Phe 


Thr 


Ala 


Leu 


He Leu 


Thr 


Glu 


Leu 


Leu 


Met 


Val 


Ala 


Leu 




995 








1000 








1005 






Thr Val 


Arg 


Thr 


Trp 


His 


Trp Leu 


Met 


Val 


Val 


Ala 


Glu 


Phe 


Leu 


Ser 


1010 








1015 








1020 








Leu Gly Cys 


Tyr 


Val 


Ser 


Ser Leu 


Ala 


Phe 


Leu 


Asn 


Glu 


Tyr 


Phe 


Gly 


1025 








1030 






'1035 








1040 


lie Gly Arg 


Val 


Ser 


Phe 


Gly. Ala 


Phe 


Leu 


Asp 


Val 


Ala 


Phe 


He 


Thr 








1045 






1050 








1055 


Thr Val 


Thr 


Phe 


Leu 


Trp 


Lys Val 


Ser 


Ala 


He 


Thr 


Val 


Val 


Ser 


Cys 






1060 






1065 








1070 




Leu Pro 


Leu 


Tyr 


Val 


Leu 


Lys Tyr 


Leu Arg Arg 


Lys 


Ser 


Ser 


Pro 


Pro 




1075 






1080 








1085 . 






Ser Tyr 


Cys 


Lys 


Leu 


Ala 


Ser 


















1090 








1095 


















<210> 27 


























<211> 3255 


























<212> DNA 


























<2,13> Homo sapiens 






















<220> 




























<221> CDS 


























.<222> (1).. 


. (3255) 






















<400> 27 


























atg cca 


eta 


atg 


atg 


tct 


gaa gaa 


ggc 


ttt 


gag 


aat 


gag 


gaa 


agt 


gat 


Met Pro 


Leu 


Met 


Met 


Ser 


Glu Glu 


Gly 


Phe 


Glu 


Asn Glu 


Glu 


Ser 


Asp 


1 






5 








10 










15 




tac cac 


acc 


tta- 


cca 


cga 


gec agg 


ata 


atg 


caa 


agg. 


aaa 


aga 


gga 


ctg 


Tyr His 


Thr 


Leu 


Pro 


Arg Ala Arg 


He Met 


Gin Arg 


Lys 


Arg 


Gly 


Leu 






20 








25 










30 






gag tgg 


ttt 


gtc 


tgt 


gat 


ggc tgg 


aag 


ttc 


etc 


tgt acc 


agt 


tgc 


tgt. 


Glu Trp 


Phe 


Val 


Cys 


Asp Gly Trp 


Lys 


Phe 


Leu 


Cys 


Thr 


Ser 


Cys 


Cys 




35 








■ 40 










45 








ggt tgg 


ctg ata 


aat 


att 


tgt cga 


aga 


aag 


aaa 


gag 


ctg 


aaa 


get 


cgc 


Gly Trp 


Leu 


He 


Asn 


He 


Cys Arg 


Arg 


Lys 


Lys 


Glu 


Leu 


Lys 


Ala 


Arg 



50 55 60 
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aca gta tgg ctt gga tgt cct gaa aag tgt gaa gaa aaa cat ccc agg 240 

Thr Val Trp Leu Gly Cys Pro Glu Lys Cys Glu' Glu Lys His Pro Arg 
65 TO . . 75 80 

aat tct, ata aaa aat caa aaa tac aat gtg ttt ace ttt ata. cct ggg 288 
Asn Ser lie Lys Asn Gin Lys Tyr Asn Val Phe Tho: Phe lie Pro Gly 
85 90 ■ 95 

gtt ttg tat gaa caa ttc aag ttt ttc ttg aat etc tat ttt eta gtg 336 
Val Leu Tyr Glu Gin Phe Lys Phe Phe Leu, Asn Leu. Tyr Phe Leu Val 
100 , 105 iio 

ata tec tgc tea cag ttt gta cca gca ttg aaa ata ggc tat etc tap -384 
lie Ser Cys Ser Gin Phe Val Pro Ala Leu Lys lie Gly Tyr Leu Tyr 
115 • 120 ; 125 ■ ■ 

acp tac tgg get cct ctg gga ;,ttt gtc ttg get gtt act atg aca egg 432 
Thr Tyr Trp Ala Pro Leu Gly Phe Val Leu Ala Val Thr Met Thr Arg 
130 135 140 

1 gaa gca att gat gaa ttt egg cgt ttt cag cgt gac aag gaa gtg aat 480 
Glu Ala lie Asp Glu Phe Arg Arg Phe Gin Arg Asp Lys Glu Val Asn 
14 5 150 155 ( '■ 160 

tea caa eta tat age aag ctt aca gta aga ggt aaa gtg caa gtt aag 528 
Ser Gin Leu Tyr Ser Lys Leu Thr Val Arg Gly Lys Val Gin Val Lys 
165 17 0 ' 17 5 

agt tea gac ata caa gtt gga gac etc ate ata gtg gaa aag aat caa 57 6 
Ser Ser Asp lie Gin Val Gly Asp Leu lie He Val Glu Lys Asn Gin 
180 185 190 

aga att cca teg gac atg gtg ttt ctt agg act tea gaa aaa gca ggt 624 
Arg He Pro Ser Asp Met Val Phe Leu Arg Thr Ser Glu Lys Ala Gly 
195 200 205 

teg tgt ttt att cga act gat caa eta gat ggt gaa act gac tgg aag 672 
Ser Cys Phe He Arg Thr Asp Gin Leu Asp Gly Glu Thr Asp Trp Lys 
210 215 220 * 

ctg aag gtg gca gtg age tgc acg caa cag ctg c£g get ctg ggg gac 720 
Leu Lys Val Ala Val Ser Cys Thr Gin Gin Leu Pro Ala Leu Gly Asp 
225 230 235 240 

ctt ttt tct ate agt get tat gtt tat get cag aaa cca caa atg gac 7 68 
Leu Phe Ser lie Ser Ala Tyr Val Tyr Ala Gin Lys Pro Gin Met Asp 
245 250 255 

att cac agt ttc gaa ggc aca ttt acc agg gaa gac agt gac ccg ccc 816 

He His Ser Phe Glu Gly Thr Phe Thr Arg Glu Asp Ser Asp Pro Pro 

260 2 65 270 

i 

att cat gaa agt etc age ata gaa aat aca ttg tgg gca age acc att 864 

lie His Glu Ser Leu Ser He Glu Asn Thr Leu Trp Ala Ser Thr He 
275 280 285 

gtt gca tea ggt act gta ata ggt gtt gtc att tat acc gga aaa gag 912 
Val Ala Ser Gly Thr Val lie Gly Val Val He Tyr Thr Gly Lys Glu 
290 ** 295 300 
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act cga agt gta atg aac aca tec aat cca aaa aat aag gtt ggt ttg 960 1 
Thr Arg Ser Val Met ilsn Thr«*Ser Asn Pro Lys Asn Lys Vai Gly Leu* 
305 310 315 320 

. • •■ • • ' , t ' 

ttg gac ctt gaa etc aat egg ctg acg ,aaa gcg eta ttt ttg get tta 1.008 

Leu Asp Leu' Glu Leu Asn Arg Leu Thr Lys Ala Leu Phe Leu Ala Leu 

325 330- 335 

gtt get ctt tec att gtt atg gta acc tta caa gga ttt gtg ggt cca 1056 
Val Ala Leu Ser lie Val Met Vai .-Thr Leu Gin Gly Phe Val Gly Pro , 
340 f - 345 ■ 350 

tgg tac cgc aat ctt ttt cg§ ttc ctt .etc etc ttt tct tac ate att 1104 
Trp Tyr Arg Asn 'Leu Phe Arg Phe Leu Leu Leu Phe Ser Tyr lie lie 
355i • 360 365 ' 

ccc ata agt ttg cgt gtg aac ttg gac atg ggc aaa gcg gtg tat gga 1152 
Pro lie Ser Leu. Arg Val Asn Leu Asp Met Gly Lys Ala Val Tyr Gly 
370 375 380 

tgg atg. atg atg aaa gat gag aac ate cct ggc acg gtc gtt egg acc 1200 
Trp Met Met Met Lys Asp Glu Asn lie Pro Gly Thr Val Val Arg Thr 

385 • 390 , ♦ : . 395 400 • 

age act ate cqa gag gaa ctt ggg cgc ctg gtg tat tta ttg aca gac 1248 
Ser Thr lie Pro Glu 'Glu Leu Gly Arg Leu Val Tyr Leu Leu Thr Asp 
405 ' 410 415 

aaa aca gga acc etc acc cag aat gaa atg ata. ttt aag egg ctg cac 1296 
Lys Thr Gly Thr Leu Thr Gin Asn Glu Met lie Phe Lys Arg Leu His 
420 425 ( 430 

ctg ggc acc gtg tec tat ggc gee gac acg atg gat gag ate cag age 1344 
Leu Gly Thr Val Ser Tyr Gly Ala Asp Thr Met Asp Glu lie Gin Ser 
435 440 445 

cat gtc agg gac tec tac tea cag atg caa; tct caa get ggt gga aac 1392 
His Val Arg Asp Ser Tyr Ser Gin Met Gin Ser Gin. Ala Gly Gly Asn 
450 ~ 455 t 460 

aat act ggt tea act cca eta aga aaa gee caa tat tea get ccc aaa 1440 
Asn Thr Gly Ser Thr Pro Leu Arg Lys Ala Gin Ser Ser Ala Pro Lys 
465 470 475 480 

gtt agg aaa agt gtc agt agt cga ate cat gaa gee gtg aaa gee ate 1488 
Val Arg Lys Ser Val Ser Ser Arg lie His Glu Ala Val Lys Ala lie 
485 490 495 

gtg ctg tgt cac aac gtg acc ccc gtg tat gag tct egg gee ggc gtt 1536 
Val Leu Cys His Asn Val Thr Pro Val Tyr Glu Ser Arg Ala Gly Val 
500 505 510 

act gag gag act gag ttc gca gag get gac caa gac ttc agt gat gag 1584 
Thr Glu Glu Thr Glu Phe Ala Glu Ala Asp Gin Asp Phe Ser Asp Glu 
515 520 525 

. aat cgc acc tac cag get tec age ccg gat gag gtc get ctg gtg cag 1632 
Asn Arg Thr Tyr Gin Ala Ser Ser Pro Asp Glu Val Ala Leu Val Gin 
530 535 540 
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tgg aca gag agt gtg ggc etc acg ctg gtc age agg gac etc ace tec . 1680 
Trp Thr Glu Ser Val Gly Leu Thr Leu Val Sex Arg Asp Leu Thr Ser - 
545 - 550 . 555 560 

atg cag ctg aag acc ccc agt ggc cag gtc etc age ttc tgc att ctg 1728 
Met Gin Leu Lys Thr Pro Ser Gly Gin Val Leu Se,r Phe Cys.Ile Leu 
565 570 , 575 

cag ctg ttt ccc ttc ace tec gag age aag egg atg ggc gtc ate gtc . 1776 

Gin Leu Phe Pro Phe Thr Ser Glu Ser Lys Arg Met Gly Val lie Val . 
580 , : 585 590 

i 1 . 

agg gat gaa tec acg gca gaa ate aca ttc tac atg aag ggc get gac . 1824 

Arg Asp Glu Ser Thr Ala Glu lie Thr Phe Tyr Met Lys Gly Ala Asp 

595 i 600 605 

'gt;g gee atg tct cct ate gtg cag tat aat gac tgg ctg gaa gag gag 1872 
Val Ala Met Ser Pro lie Val Gin Tyr Asn Asp Trp Leu Glu Glu Glu 
610 615 . 620 

t'gc gga aac atg get cgc gaa gga ctg egg ace etc gtg gtt gca aag 1920 
Cys Gly Asn Met Ala Arg Glu Gly Leu Arg Thr Leu Val Val Ala Lys 
625 630 635 -640 

aag gcg ttg aca gag gag cag tac cag gac ttt gag age cga tac act 1968 
Lys Ala Leu Thr Glu Glu Gin Tyr Gin Asp Phe Glu Ser Arg Tyr Thr 
645 650 « 655 

caa gee aag ctg age atg cac gac agg tec etc aag gtg gee gcg gta 2016 
Gin Ala Lys Leu Ser Met His Asp Arg Ser Leu Lys Val Ala Ala Val 
660 665 . 670 

gtc gag age ctg gag agg gag atg gaa ctg ctg tgc etc acc ggc gtg 2064 
Val Glu Ser Leu Glu Arg Glu Met Glu Leu Leu Cys Leu Thr Gly Val 
675 680 685 

gag gac cag ctg cag gca gac gtg egg ccc acg ctg gag atg ctg cgc 2112 
Glu Asp Gin Leu Gin Ala Asp Val Arg Pro Thr Leu Glu Met Leu Arg 
690 695 700 

aac gee ggg ate aag ata tgg atg ct'a aca ggc gat aaa etc gag aca 2160 
.Asn Ala Gly He Lys He Trp Met Leu Thr Gly Asp Lys Leu Glu Thr 
. 705 710 715 720 

get acc" tgc att gec aaa agt tea cat etc gtg tct aga aca caa gat 2208 
Ala Thr Gys lie Ala Lys Ser Ser His Leu Val Ser Arg Thr Gin Asp 
725 730 735 

att cat att ttc aga cag gta acc agt egg gga gag. gca cat ttg gag 2256 
lie His lie Phe Arg Gin Val Thr Ser Arg Gly Glu Ala His Leu Glu 
740 745 750 

ctg aat gca ttt cga agg aag cat gat tgt gca eta gtc ata tct ggg 2304 
Leu Asn Ala Phe Arg Arg Lys His 'Asp Cys Ala Leu Val lie Ser Gly 
755 760 ~ 765 

gac tct ctg gag gtt tgt eta aag tac tac gag cat gaa ttt gtg gag 2352 
Asp Ser Leu. Glu Val Cys Leu Lys Tyr Tyr Glu His Glu Phe Val Glu 
770 775 780 
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ctg gcc tgc cag tgc cct gcc gtg gtt tgc tgc cgc tgc tea ccc acc 2400 

Leu Ala Cys Gin Cys » Pro Ala »Val Val- Cys Cys Arg Cys Set Pro Thr ' 

785 790 795 -800 

' ' '■ ' . r 

cag aag gcc cgc att gtg aca ctg ctg .cag cag cac aca ggg aga cgc 2,448 

Gin Lys Ala' Arg He Val Thr Leu Leu . Gin Gin His . Thr. Gly Arg Arg 

805 810- 815 

acc tgc gcc ate ggt gat gga gga aat gat gtc age atg att cag gca 2496 
Thr Cys Ala He Gly Asp Gly Gly.Asn Asp Val Ser' Met He Gin Ala , 
820 • 825 • 830 

* 

gca gac tgt ggg att ggg att gag gga aag gag ggt aaa cag gcc teg 254 4 
Ala Asp Cys Gly 'He Gly He Glu Gly Lys Glu Gly Lys Gin Ala Ser 
835« ' 840 845 ' 

ctg gcg gcc gac ttc tec ate acg cag ttc egg cac ata ggc agg ctg 2592 

Leu Ala Ala Asp Phe Ser He Thr Gin Phe Arg His He Gly Arg Leu 

850 . 855 " 860 ' 
. i 

etc atg gtg cac ggg egg aac age tac aag agg teg gcg gca etc ggc 2640 

Leu Met Val His Gly Arg Asn Ser Tyr Lys Arg Ser Ala Ala Leu Gly 

865 870 , . " . " 875 880 



cag ttc gtc atg cac agg ggc ctt. ate ate tec ace atg cag get gtg 
Gin Phe Val Met His * Arg Gly Leu He He Ser Thr Met Gin Ala Val 
885 '890 895 



acc gtc cgc acg tgg cac tgg ctg atg gtg gtg gcc gag ttc etc age 
Thr Val Arg Thr Trp His Trp Leu Met Val Val Ala Glu Phe Leu Ser 
1010 1015 1020 



-101- 



2688 



ttt tec tea gtc ttc tac ttc gca tec gtc cct ttg tat cag ggc ttc 2736 
Phe Ser Ser Val Phe Tyr Phe Ala Ser Val Pro Leu Tyr Gin Gly Phe 
900 905 , 910 

etc atg gtg ggg tat gcc acc ata tac acc atg ttc cca gtg ttc tec 2784 
Leu Met Val Gly Tyr Ala Thr He Tyr Thr Met Phe Pro Val Phe Ser 
915 920 925 

tta gtg ctg gac cag gac gtg aag cca gag atg. gcg atg etc tac ccg 2832 
Leu Val Leu Asp Gin Asp Val Lys Pro Glu Met Ala- Met Leu Tyr Pro 
930 935 940 

gag ctg tac aag gac etc acc aag gga aga tec ttg tec ttc aaa acc 2880 
Glu Leu Tyr Lys Asp Leu Thr Lys Gly Arg Ser Leu Ser Phe Lys Thr 
945 - 950 955 960 

ttc etc ate tgg gtt tta ata agt att tac caa ggc ggc ate etc atg 2928 
Phe Leu lie Trp Val Leu lie Ser lie Tyr Gin Gly Gly lie Leu Met 
965 970 975 

tat ggg gcc ctg gtg etc ttc gag tct gag ttc gtc cac gtg gtg gcc 2976 
Tyr Gly Ala Leu Val Leu Phe Glu Ser Glu Phe Val His Val Val Ala 
980 985 990 

ate tec ttc acc gca ctg ate ctg acc gag ctg ctg atg gtg gcg ctg 3024 
He Ser Phe Thr Ala Leu He Leu Thr Glu Leu Leu Met *Val Ala Leu 
995 1000 1005 
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tta ggc tgc tac gtg tpc tea etc get ttt etc aat gaa tat ttt gat 3120 

Leu Gly Cys Tyr Val , Ser 'Ser »• Leu Ala Rhe «Leu. Asn Glu Tyr Phe Asp- 
1025 ' 1030 1035 .1040 

• - ; . , . , r 

gtt gec ttt ate acc acc gtg acc ttc;ctg tgg aaa gtg teg gcg ate 3168 

Val Ala Phe* lie Thr Thr Val Thr Phe Leu Trp Lys Val Ser Ala Jle 

1045 1050 . 1055 . 

acc gtg gtc age tgc etc ccg ctg tat gtc etc aag tac ctg agg cgc 3216 

Thr Val Val Ser Cys Leu Pro Leu Tyr Val Leu Lys« Tyr Leu Arg Arg 
1060 , ' 1065 , 1070 

I . * ' • ' • • 

aag tct tct cct ccc age tac tgc aag ctg gee tec taa 3255 

Lys Ser Ser Pro' Pro Ser Tyr Cys Lys Leu. Ala Ser *. 

1015 . 1080 • 



<210> 28 , 
<211> 464 
<212> PRT , 

<213> Escherichia coli 



<400> 28 

Met Pro Asp Ala Lys Lys Gin Gly Arg Ser Asn Lys Ala, Met Thr Phe 
1 ■ 5 10* 15 1 

Phe Val Cys Phe Leu ' Ala Ala Leu Ala Gly Leu Leu Phe ,Gly Leu Asp 

20 25 30 

lie Gly Val He Ala Gly Ala Leu Pro Phe He Ala Asp Glu Phe Gin 

35 40 45 

He Thr Ser His Thr Gin Glu Trp Val Val Ser Ser Met Met Phe Gly 

50 55 60 

Ala Ala Val Gly Ala Val Gly Ser Gly T.rp Leu Ser, Phe Lys Leu Gly 
65 70 75 80 

Arg Lys Lys Ser Leu Met lie Gly Ala He Leu Phe Val Ala Gly Ser 

85 90 95 

Leu Phe Ser Ala Ala Ala Pro Asn Val Glu Val Leu He Leu Ser Arg 

100 105 110 

Val Leu Leu Gly Leu Ala Val Gly Val Ala Ser Tyr Thr Ala Pro Leu 

li5 120 l'25 

Tyr Leu Ser Glu He Ala Pro Glu Lys He Arg Gly Ser "Met He Ser 

130 135 140 

Met Tyr Gin Leu Met lie Thr He Gly He Leu Gly Ala Tyr Leu Ser 
145 150 155 160 

Asp Thr Ala Phe Ser Tyr Thr Gly Ala Trp Arg Trp Met Leu Gly Val 

165 • 170 175 

He lie lie Pro Ala lie Leu Leu Leu He Gly Val Phe Phe Leu Pro 

180 185 190 

Asp Ser Pro Arg Trp Phe Ala Ala Lys Arg Arg Phe Val Asp Ala Glu 

195 200 205 

Arg Val Leu Leu Arg Leu Arg Asp Thr Ser Ala Glu Ala Lys Arg Glu 

210 215 220 

Leu Asp Glu lie Arg Glu Ser Leu Gin Val Lys Gin Ser Gly Trp Ala 
225 230 235 240 

Leu Phe Lys Glu Asn Ser Asn Phe Arg Arg Ala Val Phe Leu Gly Val 

245 250 255 

Leu Leu Gin Val Met Gin Gin Phe Thr Gly Met Asn Val He Met Tyr 

260 265 270 

Tyr Ala Pro Lys He Phe Glu Leu Ala Gly Tyr Thr Asn Thr Thr Glu 

275 280 285 

Gin Met Trp Gly Thr Val He Val Gly Leu Thr Asn Val Leu Ala Thr. 
290 295 300 
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Phe 


He 


Ala 


He 


Gly Leu Val Asp Arg Trp Gly Arg Lys Pro 


Thr Leu 


305 










310 - 


315' 




320 


Thr Leu Gly 


Phe 


Leu Val Met Ala 


Ala Gly Met 


Gly Val Leu 


Gly Thr 










325 




• 330 


• 


335 


Met 


Met 


His 


He 


Gly 


He His Ser 


Pro Ser Ala 


Giii^Tyr Phe 


Ala lie 








340 






345 . 


, 350 




Ala 


Met 


Leu 


Leu 


Met 


Phe He Val 


Gly Phe Ala 


Met Ser Ala 


Gly Pro 






355 






360 




365 




Leu 


He 


Trp Val 


Leu 


Cys Ser Glu He Gin Pro Leu Lys Gly Arg Asp 




370 








375 




380 




Phe 


Gly 


He 


Thr 


Cys 


Sen Thr Ala 


Thr Asn Trp 


He Ala Asn 


Met He 


385 










390 


395 


» 


400" 


Val 


Gly 


Ala 


Thr 


Phe 


Leu Thr Met 


Leu Asn Thr 


Leu Gly Asn Ala Asn 








4 05 




410 




415 


Thr 


Phe 


Trp 


Val 


Tyr 


Ala Al'a Leu 


Asn Val Leu 


Phe He Leu 


Leu Thr 








420 






4 25 


430 




Leu 


Trp 


Leu 


Val 


Pro 


Glu Thr ., Lys 


His Val Ser 


Leu Glu His 


He Glu 






435 






440 




445 




Arg 


Asn 


Leu 


Met 


Lys 


Gly Arg Lys 


Leu Arg Glu 


He Gly Ala 


His Asp 




450 








455 




460 





<210> 29 
<211> 472 
<212> PRT 

<213> Escherichia coli 



<400> 29 



Met 


Val 


Thr 


He 


Asn 


Thr Glu 


Ser 


Ala 


Leu 


Thr 


Pro 


Arg 


Ser 


Leu 


Arg 


1 








5 








10 










15 




Asp 


Thr 


Arg 


Arg 


Met 


Asn Met 


Phe 


Val 


Ser 


Val 


Ala 


Ala 


Ala 


Val 


Ala 








20 








25 










30 






Gly 


Leu 


Leu 


Phe 


Gly 


Leu Asp 


He 


Gly Val 


He 


Ala 


Gly Ala 


Leu 


Pro 






35 








40 










45 








Phe 


He 


Thr 


Asp 


His 


Phe Val 


Leu 


Thr 


Ser 


Arg 


Leu 


Gin 


Glu 


Trp 


Val 




50 








55 










60 










Val 


Ser 


Ser 


Met 


Met 


Leu Gly Ala Ala 


He 


Gly Ala 


Leu 


Phe 


Asn 


Gly 


65, 










70 








75 










80 


Trp 


Leu 


Ser 


Phe 


Arg 


Leu Gly Arg 


Lys 


Tyr 


Ser 


Leu 


Met 


Ala 


Gly Ala 










85 








90 










95 




He 


Leu 


Phe 


Val 


Leu 


Gly Ser 


He 


Gly 


Ser 


Ala 


Phe 


Ala 


Thr 


Ser 


Val 








100 








105 










110 






Glu 


Met 


Leu 


lie 


Ala 


Ala Arg 


Val 


Val 


Leu 


Gly 


He 


Ala 


Val 


Gly 


He 






.115 








120 










123 








Ala 


Ser 


Tyr 


Thr 


Ala 


Pro Leu 


Tyr 


Leu 


Ser 


Glu 


Met 


Ala 


Ser 


Glu 


Asn 




130 








135 










140 










Val 


Airg 


Gly 


Lys 


Met 


He Ser 


Met 


Tyr 


Gin 


Leu 


Met 


Val 


Thr 


Leu Gly 


145 










150 








155 










160 


He 


Val 


Leu 


Ala 


Phe 


Leu Ser 


Asp 


Thr 


Ala 


Phe 


Ser 


Tyr 


Ser 


Gly Asn 










165 








170 










175 




Trp 


Arg 


Ala 


Met 


Leu 


Gly Val 


Leu 


Ala 


Leu 


Pro 


Ala 


Val 


Leu 


Leu 


He 




180 








185 










190 






He 


Leu 


Val 


Val 


Phe 


Leu Pro 


Asn 


Ser 


Pro 


Arg 


Trp 


Leu 


Ala 


Glu 


Lys 






195 








200 










205 








Gly 


Arg 


His 


He 


Glu 


Ala Glu 


Glu 


Val 


Leu 


Arg 


Met 


Leu 


Arg 


Asp 


Thr 




210 








215 










220 










Ser 


Glu 


Lys 


Ala Arg 


Glu Glu 


Leu 


Asn 


Glu 


He 


Arg 


Glu 


Ser 


Leu 


Lys 


225 










230 








235 










240 


Leu 


Lys 


Gin 


Gly Gly 


Trp Ala 


Leu 


Phe 


Lys 


He 


Asn Arg 


Asn 


Val 


Arg 










245 








250 










255 
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Arg 


Ala 


Val 


Phe 


Leu 


Gly Met Leu 


Leu 


Gin 


Ala Met 


Gin 


Gin 


Phe Thr 






260 






265 








270 




Glv 


Met 


Asn 


He 


lie 


Met Tyr Tyr 


Ala 


Pro Arg He 


Phe 


Lys 


Met Ala 






275- 






280 






■ • 


28^ 






Glv 


Phe 

T 


Thr 


Thr 


Thr 


Glu Gin Gin 


Met 


lie 


Ala Thr 


Leu 


Val 


Val Gly 


290 








295 






300 








Leu 


Thr 


Phe 


Met 


Phe 


Ala Thr Phe 


He 


Ala 


Val Phe 


Thr 


Val 


Asp Lys 


305 










310 






315 






320 


Ala 


Gly 


Arg 


Lys 


Pro 


Ala Leu Lys 


He 


Gly 


Phe Ser 


Val 


Met 


Ala Leu 








325 






33Q 








335 


Gly 


Thr 


Leu 


Val 


Leu 


Gly Tyr Cys 


Leu 


Met 


Gin Phe Asp Asn Gly Thr 








340 






34 5 






i ■ 


350 




Ala 


Ser 


Ser Gly 


Leu 


Ser Trp Leu 


Ser 


Val 


Gly Met 


Thr 


Met 


Met Cys 






355 






360 








365 






lie 


Ala 


Gly 


Tyr 


Ala 


Met Ser Ala 


Ala 


Pro 


Val Val 


Trp 


He 


Leu Cys 




370 








375 






,'380 








*Se,r 


Glu 


He 


Gin 


Pro 


Leu Lys ,Cys 


Arg 


Asp 


Phe Gly 


lie 


Thr 


Cys Ser 


385 










390 






395 






400 


Thr 


Thr 


Thr 


Asn 


Trp 


Val Ser Asn 


Met 


lie lie Gly Ala 


Thr 


Phe Leu 










405 






410 








415 


Thr 


Leu 


Leu 


Asp 


Ser 


lie Gly Ala 


Ala 


Gly 


Thr Phe 


Trp 


Leu 


Tyr Thr 








420 






4 25 








430 




Ala 


Leu 


Asn 


lie 


Ala 


Phe Val Gly 


He 


Thr 


Phe Trp 


Leu 


lie 


Pro Glu 






435 






440 








445* 






Thr 


Lys 


Asn 


Val 


Thr 


Leu Glu His 


He 


Glu 


Arg Lys 


Leu 


Met 


Ala Gly 




450 








455 






4 60 








Glu 


Lys 


Leu 


Arg 


Asn 


He Gly Val 








i. 






465 










470 















<210> 30 
<211> 526 
<212> PRT 

<213> Homo sapiens 



<400> 30 



Met 


Thr 


Ser 


Asp 


His 


Glu 


His 


Met 


Thr 


Ala 


Val Cys 


Ala 


Ser 


His 


Val 


1, 








5 










10 








15 




Gin 


Thr 


His 


Gly 


Ser 


Gin 


Leu 


Gin 


He 


Gin 


Lys Leu 


Ser 


Pro 


Cys 


Phe 








20 










25 








30 






Arg 


Pro 


Pro 


Thr 


Pro 


Ala 


Phe 


Arg 


He 


Ser 


Ser S;er 


He 


He 


Leu 


Leu 




35 










4 0 








45 








Gly 


Ala 


Gly 


Leu 


Ala 


Gly 


Pro 


Ser 


Thr 


Gly Asp Arg 


Trp 


Phe 


Gly 


Val 




50 










55 








■- 60 










Ser 


Val 


Val 


Gly 


Thr 


Gly 


Leu 


Phe 


Leu 


Pro 


Pro Leu 


Gin 


Leu 


Leu 


Leu 


65 










70 










75 








80 


Pro 


Pro 


Arg 


Leu 


Leu 


Phe 


Thr 


His 


Ala 


He 


Leu Glu 


Arg 


Leu 


His 


Leu 










85 










90 












Trp 


Leu 


Ala 


Leu 


Pro 


Pro 


Val 


Leu 


Val 


Leu 


Gly His 


Ala 


Leu 


Leu 


His 






100 










105 








110 






Cys 


Lys 


Val 


Gly 


Gly Ser 


Thr 


Ala, 


Arg Ala 


Gly Asp Gin Leu 


Val 


Gin 




* 


115 










120 








125 








Arg 


Val 


Leu 


Leu 


Leu 


He 


Val 


Phe 


Leu 


His 


Arg Trp 


Val 


Gin 


Val 


Trp 




130 










135 








140 










Pro 


Glx 


Gly 


Thr 


Glu Val Asfi He Leu Gly Met Gly Ser Arg Thr Gly 


145 










150 










155 








160 


Gly 


Arg 


Arg 


Gly 


Pro 


Glu 


Leu 


Arg 


Pro 


Gly 


Phe Arg 


He 


Ser 


lie 


Leu 










165 










170 








175 




Ser 


Ala 


Tyr 


He 


Ser 


Asn 


Tyr 


Ala 


Phe 


Ala 


Asn Val 


Phe 


His 


Gly Trp 








180 










185 








190 
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Lys 


Tyr 


Met 


Phe Gly Leu Val lie 


Pro 


Leu 


Gly Val 


Leu- 


Gin Ala 


He 




195 




i » 


» 


200 




r 


Phe 




205 






* 


Ala. 


Met 


Tyr 


Phe 


Leu 


Pro 


Pro 


Ser 


Pro 


Arg 


Leu 


Val 


Met 


Lys .Gly 




210 








■ 


21,5 










220 










Gin 


Glu 


Gly 


Ala 


Ala 


Ser 


Lys 


Val 


Leu Gly Arg Leu Arg 


Ala 


Leu 


Ser 


225 










230 










235 










240. 


Asp 


Thr 


Thr 


Glu 


Glu 


Leu 


Thr 


Val 


He 


Lys. 


Ser 


Ser 'Leu 


Lys 


Asp 


Glu 










245 










250 










255 




Tyr 


Gin 


Tyr 


Ser 


Phe 


Trp 


Asp Leu 


Phe Arg 


Ser 


Lys 


Asp Asn Met 


Arg 








260 










265 










270 






Thr Arg 


He 


Met 


He 


Gly 


Leu 


Thr' 


Leu 


Val 


Phe, 


Phe 


Val 


Gin 


lie 


Thr 






275 








i 


280 


i 








285 








Gly Gin 


Pro 


Asn 


He 


Leu 


Phe 


Tyr 


Ala 


Ser 


Thr 


Val 


Leu 


Lys 


Ser 


Val 




290 










295 










300 










Gly 


Phe 


Gin 


Ser 


Asn .Glu 


Ala 


Ala 


Ser 


Leu 


Ala 


Ser 


Thr 


Gly Val 


Gly 


305 








i 


310 




i 






315 










320 


Val 


Val 


Lys 


Val 


He 
325 


Ser 


Thr 


He 


Pro 


Ala 
330 


Thr 


Leu 


Leu 


Val 


Asp 
335 


His 


Val 


Gly 


Ser 


Lys 


Thr 


Phe 


Leu Cys 


He 


Gly Leu 


Leu 


Asn 


Ala 


Gly 


Leu 








34 0 










345 










350 






Ser 


His 


Thr 


Glu 


Tyr 


Gin 


He 


Val 


Thr 


Asp 


Pro. 


Gly Asp Vdl 


Pro 


Ala 






355 










360 










365 








Phe 


Leu 


Lys 


Trp 


•Leu 


Ser 


Leu }kla 


Ser 


Leu 


Leu 


Val 


Tyr 


Val 


Ala 


Ala 




370 










375 










380 










Phe 


Ser 


lie 


Gly 


Leu 


Gly 


Pro 


Met 


Pro 


Trp Leu Val" Leu Ser 


Glu 


life 


385 






i 




390 










395 










400 


Phe 


Pro 


Gly 


Gly 


lie 


Arg 


Gly Arg 


Ala 


Met 


Ala 


Leu 


Thr 


Ser 


Ser 


Met 










405 










410 










415 




Asn 


Trp 


Gly 


He 
420 


Asn 


Leu 


Leu 


He 


Ser 
425 


Leu 


Thr 


Phe 


Leu 


Thr 
430 


Val 


Asn 


Leu 


He 


Gly 


Leu 


Pro 


Trp 


Val 


Cys 


Phe 


He 


Tyr 


Thr 


He 


Met 


Ser 


Leu 






435 










4 40 








4 45 








Ala 


Ser 
450 


Leu 


Leu 


Phe 


Val 


Val 
455 


Met 


Phe 


lie 


Pro 


Glu* 
4 60 


Thr 


Lys 


Gly 


Cys 


Ser 


Leu 


Glu 


Gin 


lie 


Ser 


Met 


Glu 


Leu 


Ala 


Lys 


Val 


Asn 


Tyr 


Val 


Lys 


465 










470 










475 










480 


Asn 


Asn 


He 


Cys 


Phe 
485 


Met 


Ser 


His 


His 


Gin 
4 90 


Glu 


Glu 

• 


Leu 


Val 


Pro 
495 


Lys 


Gin 


Pro 


Gin 


Lys 
500 


Arg 


Lys 


Pro 


Gin 


Glu 
505 


Gin 


Leu 


Leu 


Glu 


Cys 
510 


Asn 


Lys 


Leu 


Cys 


Gly Arg 


Gly Gin 


Ser 


Arg 


Gin 


Leu 


Ser 


Pro 


Glu 


Thr 










515 










520 










525 









<210> 31 
<211> 25002 
<212> DNA 

<213> Caenorhabditis elegans 



<400> 31 

accctgcaat 

tgttgacaaa 

cacagaaagt 

gttgtaaaac 

aaaagaaatg 

tgtcttattt 

gtggaagttg 

gttttaagaa 

tgtttttttt 

taagcagaat 

ctggaatgcc 



aggatatcat 
ttttcaaata 
gagggcgatc 
atttttccgt 
tatggatgga 
tttcagatat 
gaatgaatcc 
gtgaagtatt 
ttaaaataca 
gattgaaaaa 
tcctggttac 



tacgatgtaa 
atgaaatgtg 
ttcattttaa 
cgaacaatta 
acagaaccag 
gtcgaagcgt 
aatggcaatt 
gaatcaaaga 
tctattaaat 
ttaacatttg 
agagtaagtt 



gtattcaata 
aatgcttcat 
ctgcaaaatt 
tcacgggaag 
ttggtaagtt 
ctttgccgag 
caagcagaag 
gaatcgctag 
ttcagttgat 
tggaaagtaa 
aatgattttt 



tttatttcaa 
gaattcttaa 
tcaggtacca 
aaaattcaca 
tcttttaatt 
cttgtcgctt 
ttaaaactga 
gggttgtttc 
agtaaccgag 
ggttgaacca 
ttttaatttg 



atttgtattt 60 
gctttcgggg 120 
tcatgtaatg 180 
tgtgctaaac. 240 
ttaagggtaa 300 
tgcaaaatgt 360 
tgaaggcaaa 420. 
atctgtaagt 4 80 
gaagatttgt 540 
ttacatcgtt 600 
caagaacttt 660 
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aattgatttc' aggacataag. aaaattggaa gaaattcttg attcaaaacc agtacttgtt 720* 
gttacggata ttcccaattt ga'a'attctgc ccttcaccat gtgcgaatga" aaaaaggtga 780 

. qttttttatt aaaatgagat tgaaaatttt cccacttctt tagaccaaga c gacatttca 84 0 / 
ccaactacgt tcacacaagt tacqtggctt ctcttgaatc ttcaaaaatg tttgaa^tttt ,900 
cttcacaact tgatttggaa tccagaattc ttt;t;aatgaa acacgcgaca ctaatttgtt 960 
cgaatatgat bjaatgcattc ttttcgatta acgaaatgaa atcaga'tatt cttaggcatc 1020 
cggatgggag catgtcagga catatgagaa aattcgatcg agagaatgat gtcatgacag 1080 
atcatgtcaa attgatacag aagacattga tcactttcct aaatcataaa gtcgacaaaa 1140 
ttgaatatct tttattcaaa gctattatgc tttgcaatcc aggtattttc cttgccgtca 1200 
tatatataat ttaaacgatt acttgaaatt tcagctgtgc ccggattaga ttcctggaac 1260 ■ 
caagaaatca tcgaaaagga acgaaatcaa tacgttaaag. ctcttctaaa ttattgtctc 1320 
cttcaacatg gaaaacttca aggcccaaca agattcgcaa 'ctattctcgc aatggctcca 1386 t 
attatagaga atcaatcgaa aaatcaaaaa gatttccatg tgtacattaa agcaaagcat 1440 
ttccagaagc acaaa'aagat gggaacaacg ttcagaaaat gtatcagttg catgtttgat 1500 
caaattatgg eaacataaat aatcaatgta aataaatgac aatgtttttg tttaactgtt 1560 
aatataatct gttcatcctc tgtata f atat tctacctcaa aatacaaaaa taaacatctc 1620 
ataatttatt cccctaattc aaac.tacaaa gcatgatgga aactctatac attagtcgtt 1680 
tgtcgtttpt tgtggacgtt . tcatgtggtg ctgaagttag gaatttgggc gatttaccta 1740 
cttgatgtaa cttctaagtt taacctgaca tttgcaattt taagaatttt ccaagcatta 1800 1 
taaaaaaaat .agttcacact tttcatgtgc gagtagatga taattatgca agttcaatct I860 
tgcacaggat taaacctact agttcacttc tgatttttat atattaactt 1 atcaatgtct 1920 
tatttttgaa gatttttcgt gtgacagagg tttcaaaaaa aagagatatt caatcagaat 1980 
tctaaaagac aggttgtgaa acttqaatat tttcgaatat accgcacttg tagtgaattc 2040 
agaatataat tttcaattta aaa'aacacaa tttttgaagg tacgcacqaa tcacgcatct 2100' 
gaaaatti:aa gaaactacac atatatcagc aattgatctt caaaattata acgacaaaac 2160 
tgaacttcaa actttcaaaa attccagaaa ttaaaacgac tgaagtgatt cttgcagagt 2220 
gttgactcaa agtgaccaat tggagaccat acttgttaat tctttcaaaa acattttgat 2280 
cgtcgcctgt tacgtataca aaagaagagt actgattcat caaatttgga atataatccc 2340 
acccacatgt aagtgtcgat agttgttgaa tcggctgaaa actttttccg aaacagaaag 24 00 
gagaaagaac aaacatttga tagaggcgat agtcacatca ttttccactc tttttctcta 24 60 
ccatcctatc tctaatcagt ctagtcttca ctaaatcaac cgtttttata ttccataaat 2520 
tgatccatta atgtcttcga aggataggta gatatattta tcttgcgttt attttatttt 2580 
tccataatat tccagtgatt ctccaccaga aacttttgaa gtgtgcatct cctcaccaga 2640 
aatacacaaa aaagaaataa tgactccacc acagaaatca aaaccaaaaa gaaatcgacg 2700 
gaaaatttac aaaattcttc cacaaaatca atgtccatcg gtgtgtcaaa tttgtagaaa 2760 
tccagcaatt ggatatcatt atgaggtaaa ctacagtata gtttgtttta ttgttagaag 2820 
tagtgtaaag gttcactaga gctactttaa agattacgat tcgatgagtt attcgattta 2880 
ctacatgaac gctaaactga gaatatgact ttaaaaaatg ttcaactttt cggttttgag 2940 
aaaaatcaaa aggcggacac . gaaacacgat tccacaattt aagttttcaa tactattctg 3000 
ccttggaaaa tttatttggt agaaaagtct acatacaaac ccaaaaat'ca gtataaaaat 3060 

. taagcactct aaaagagtct cacgttttag gtaccatctt gcaatggatg caaaacattc 3120 
ttccgtcgta caattatcac tggccgtaaa tttaaatgtt tcaaagttag taattgtctt 3180 
gatggtaatg acgtaatcgg tgagttcaat aagagtctta atgaaatttt tttttgatat 3240 
ttgcgttttt gatttctgcc ttttgatcgg aaacacattt ccattcaaat tttacacaca 3300 
ttttccagat acatcgaagc gtgtttgtcg agcatgtcga tttgaaaaat gtgtacaagc 3360 
gggaatgaat ccaatggcaa ttcaagcaga agcaaagact gatgaaggag aagaacttaa 3420 
gaaattgatc gctaaaaagt ttgaaaacgg agaaaaatta aatgatggaa ccgtattttt 3480 
taatgtacat gacagattga atcaaatact tggaaagttg ataaaaattg aaacaaaatt 3540 
ggaaaaagtt catgataatg gaatgccaat gggatttttg gatcaaagag atttaagcac 3600 
tgcgctctct tcaaaagtta tctataacaa tatggaaata ccgtcaatga gctatactcc 3660 
tgttaaaatt tcaaaaaaca ccggattgtg agtttttttc aaaagaaaat tccaaaaatt 3720 
tcaaaaaatt ctgaagtaat gtttaaatcg ttttatgagc attgaagctt gcctgattgg 3780 
ccaaaaataa acaaaaagct ccaggcaact gtaaaaataa ataaataaag attataattg 3840 
acaaacagta tttttacaaa ttgttgaaaa ataagttctg tgcgttttca ggccaaaacg 3900 
aaggagtcga aactttgttc attcaagttg tcttgcatct atcgaatatt ccaaaacatt 3960 
tgacttttca agtgcaattg acatttcaag taaagtacgt ttttggaaca gattttatct 4020 
gaacaatgac cttattcgat tatcttttga gattatagac tagcaccatt tttgcaagta 4080 
ttcaaaatta atattagact tatttcagat aattctcctt aaaaatactg cactatcttg 4140 
tgcaaacttg acaaatgctt atacaacatt cagaaagctg aaatcagata cacttttata 4200 
cccagatggt agtatttatg ggccaccacg acggaaaaat ggtcctttga ttgaaaaaca 4 260 
acgatcattt ttacaaaata cgttgatatc atttatgact aacaatgttg ataaaactga 4320 
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atatatttta ctaaaagcaa tagtactatg taatccaggt atgtcataag taccaaataa 4 380 

ttaattttca aatraaatatt acagcaataa tcgatctacc ttatgccgat tcaaaacata 44.40 

ttcaaagaga- gcgagaggtt tacgctcagt gtttgttccg atattgtcta ctacaacatg 4 500 

gtacattaca cgggcctgca aggttttctg ctttattatc aatattcaat gtgttggaaa 4 56"0" 

atcaacagaa ggaacaaaag gattactatt: tatacateaa' acttat&ca,t agtcaaaaac 4 620 

acaaggdtcc tgaggtttta aagaagaaat gtattagtgt aatttatgat cagattatgg 4680 ' 

attagaaaaa gaaatctaat atgatagtga ataaatagat. tg'atgaataa aaataatttt 4 74 0 

gaaatgtaca tgtttacatt tttatttgag tacaatttgt ttcttgaaac catattaacc 4 800 

atgaagtcca tcaattttat cttacaaaat aaaccactcc gacttgaaaa ttaagtgtgg 4 860 

aatttgccag tacttcagct tcctttaaaa aaaaattcaa aatgtttaat tcttttccca 4 920 

attttctatg cggtatttca gaacttcaaa ctattgggaa cattttgaga caacccccac 4 9-80 

agcttaaaag ttttcaaaat tfgtttctccg ataaaatgca catatapcta acttttgaat 5040 

ataacccatt ttctctctga gttttatcgt gatttctg'tg acgaaacact gaatgcgtac 5100 

tgcttttatc ccgatttttc attgatctcc attaaaaaca tcacagtttt gttgttatta ^5160 

tctggtttac aaagagtcat attcagctgt acatacaagg gatctaatcg caacacatta 5220 

tgtgcacacg agctttgtat cgca'aacaat acatcgcggc- taaaaatttt cgttttctct 5280 

,actttctcat. tcatttatac ttcatgcaac gtgtgtagat' gagggaagtc agacgcagac 5340 
g^tacgtaga ctccaaagaa aeaaagatgc gttgatcggt gcagacgdca gtcctctctc 5400 
taccgttcta tggt'ttctgc gattttatta ttccaccttt tgtgaagttc gatcacttta 54 60 ; 
cttttcttgc ttcaatatca gaaagatgct tcctgacttt attacagtac catcaacaag 5520.. 
tgaaaagtat gtttaatctc cagaattgtt ttgtttgcat agatgatggg aaatcaataa 5580 
ccttctgttc atgtgacgcg cggtatcata tttctcttct tttttcagat gtctttctcc 5640 
ggaattgttg ctaccaaata tggatgagaa cttggaagat tcaaaaccaa gcgtgagttt 5700 
ttcctattcg gaaaaatata ctttctaaaa aaatatt.taa aaacaatcaa aaagttaaag 5760 
tattaattat atatctcaaa acgatatatc tcaaaaacgt tatattcgaa tttcagtctt 5820 
taaaatatca aaaggcgatg atgagccgag gaacatgtcc atcaaattgt aaagtctgta 5880 
ggcactcagc tactggatat cactatgatg ttccatcttg taatcfgatgc aaaacatttt 5940 
ttcgacgatc aattctggat ggaagaaaat atacatgcct gaagatgaga aaatgtttga 6000 
gtggaactga acctgttggt atgctttaat tttctcacta attcaagatt tcactatccc 6060 
tatcatatac ttcaaaccat ttcagacttg tctaggcgca tgtgtcgagc atgccgattt 6120 
gagaaatgtg tagaagctgg aatgaatcca tctgcaatac aagccgacat gaaaaccact 6180 
gacggtgagc tgttgagaaa agaaattatg ataaagcaga agacagccgt tgattttttg 6240 
aatactccac aggtatatta caaaaataat tgcaacaatg ataatcactg attatttcag 6300 
gttattatga gttttgaaga taaagtgcaa ggaataattg gcaagctgac agtaatggaa 6360 
ttgaagatag aaccattgta tacaggagga ttgcctccag gcaatagagt atgatgcttt 6420 
tcctgtaaac tggtaacttt ataattcagg atatcaggaa attggacgag ttgattgatg 6480 
ctccactgat tctcagctat gacgaaattc caaatttaaa gtattgtccg agtgtagacg 6540 
aattgaccgg agagtgagtt tttattcaga aaaatcaatgf attccataaa attatttttc 6600 
agaataaagc ccagtggagc ttgctacata cattgcggct acctagctag tatagaatat 6660 
tcaaaaatgt ttgactttgc tcataaaata gatgttgcta gcaaagcaac tctgataaaa 6720 
catgcaacaa ttatgtgcgc agatatcatg acagctttct tctcttacta tcaacgtaaa 6780 
tccgatcgac tgattcatcc aaatggaatg tttgcaggac caccaaaata caggtgaaac 6840 

' acttcagtaa atatgttttt gaaaaatgat tttgcttgag atatggtgaa gccggtacaa 6900 
aatatcaagc ttcaatgcaa agaactctgg caacagtgct ccgacatgaa cttaatcgaa 6960 
ttgaatatat gctactcaaa gctattgtgc tttgcaatcc aggtatgacc aaaaattttg 7020 
aatcaaaaac acaaaaacat caatttcaat acaggagtcc atgaaaataa tatcttttaa 7080 
taacacgatt tgatttccag cagtctctag tctttctatt tcggtacaac aaataattgg 7140 

. aaaagaacgt gaagaatatg ttcgaactct acttacatac tgtcttctca attatggttc 7200 
tgttcatgga ccatcaagat tttctgccct tcttgctatt atgtcagttc ttgaaagtca 7260 
acagaagaat gcaaaggact ttcatttatt ggctaaagcg acaatactga aagatgcagt 7320 
tagatacaca cgaattagca atctttacga acaaatcatg gagtcgtaaa gattagaaat 7380. 
tgttgaaatt aggatatttg ggtactcgaa gtctttattt ttattacttt caatttatgt 7440 
tatatttcta tttatatata catttgagaa atacaaagaa atcagagaaa atcatcagaa 7500 
ttgttgcatt ggttctagcg ggtattgtaa ataaaatgtt atttttccgt ttcgaatctt 7560 
ctctacataa agcttttcga tcagcacata aagtgagaat agaagtttgg attaaattac 7620 
aacctgaatc cgttatccgc aatcagttta accgcatcgt aaagacaatc agcagtgaga 7 680 
tcaggagtca cgggcatctt ttcgaaatct ggtctccatt taccagttct cacttgaaca 7740 
cctcgcattc cacaagcttg agcaccgcct acgtccgaca tcaaatcgtc gccaaccatt 7800 
acaatctgga aattagaaaa tgaagtcaca aagattttgt tccaacataa gaatggaaat 7860 * 
tgaaattccg atgtataatt gttttactta cttcttctgg tttcataccc aatgcattca 7920 
ttccttgctc gaaatagaac cggcttggtt ttccaatgtt caaaacctca caattagttg 7980 
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agaacttgag- agcagcagca aaagcgccga catcaatgca cggaccatca actcgttgga 8040 
agaactttct aaaatagaaa agggaaatgt- gtta,gagaga acaggaaaea attattaccc 8100* 
attgcccatt gttatgagaa gcggtttagg catgtcgatg a gaattcgga aagct;c.tgtt 8160 
gattcgatcg aatgagaaap cttc^ttcaac ttcacccatc ^caacacaat tcggacjatga , 8220 
ggtgtcgatt ccatcgaagt attcaagcac atcttgagag gaaaaagaac tgatfcgttgt 8280 
cagagaatta •aagaatcaaa taccgtctct gacaaacaga tgtggcjcgaa gcttgttctc 8 34 0 
tcgacaatat tgtgcaacca ctggagctgg agttat.gacg tcttCctctc gaacactgaa 8 4 00 
aataaaaact attataattc caataagtag tgggtgattt gaaagctgcc tatttgacat 8460 
tattaatttt tacattattc ggtaacactt tttttctaaa tttgttctat cagacttcag 8520 
taggacattt tggtggtttt tgaattgatg actgatcaac agaaatattt tctgaatctc 8580 ■ 
aaaaaatcaa tggctgggag agtgaaattt tttttctgat, ttcttccatc ggaaaacatg 864 0 
caaaaattgg gtaaaatttg attt^ctacaa aaaccgtgaa •ataactctca aacttacttg 8700 
attccgagtc tttgaagtct ccgagcaaca tttctgtttg aatttccttt tgcattgcta 8760 
agaaatttta ccttcgagtg ttgatacaaa ctgtaaatat tttcaatatt atgtatctat 8820 
ttttgcattt ,tacaatattt ttcttatcta atgcaccgtt gttctaggtg: cacatgcacg 8880 
caaatgttta ccttgagggc attttbtata aacagttttc caaaaatttg tggaaatttt 8940 
ttcggattga gaacaaaata caggacatgt aaaacgattt gacataaaag tgataaactt 9000 
gaaaaatga,g gatacatgat ttgtatctac catttttatt aacaaaaaag agttgaagtt 9060 
tttattagct agaaaaatca ttaaaaaact tadaaattaa ccgcttcagc tgattttgga 9120. 
acagcaactc catcggattt ataaatacta ttatacaaaa ctcccgtt at atcgagcagg 9180 
aacccattaa ccgctcttcc atttgacatt cttcgaccct aaagtttcaa 1 tatgaaattt 9240 
cataatcctg aaaccaagac caaagttcag gaaacicagca aaagctataa catagaacaa 9300 
aaacagaaaa gatagaatga aaatctjtgag atagctatgg aaaag'gaaga gaggaaaggg 9360 
tctttctctc tttctttcca tgcgggggca atatgggcag aaagggtcjcg .tttagatgtt 9420- 
cttctctctg ctcactgcat gtgcacccag tacgtcttct tcacttctct caccctcttc 94 80 
actcctttct tgttccacta tctcttcaat acacatgtcc agc.tgcctat tggaccttct 954 0 
gttttcaagg tgttgtcaca acttttgagc tatgaaacta ctttgatgct aattagatat 9600 
gatttagtaa tatgtatttt gtttcatagc tcacacagaa aatgtatatc atgagaaaca 9660 
tcagagagat ttagcaatgg atccgtaatg tatctttaat gaactcaaga tattcatatt 9720 
ttaaaacgtg tttacactca aaataaatga atattgtaag aaaacaetga atcgcatacc 9780 
caaaagcctc caaactttca aaagatgtag agtaaccata caaaacaata ggaagagttc 9840 
gaaatgtgca gtcacgcgcg catttcttgt gtctptaagg tgtgaattac gcaaaGacag 9900 
agacggcaga ttgcggtatg tcggtcgaga atgatgaatt acgatcaaga caagaaaaga 9960 
agaagaagaa ggaaagacga accagagatt ttcgactgca gtagctgagg aacaattgta 10020 
gaagaatact cggcgtcact ttataaataa atgaaacaaa atcttgatta aaagagaagc 10080 
tagaaggtca cgttaatact atgaggttat gcacttgaaa ttatgcaata tttcaaatgg 1014 0 
cacagaaaaa accaacaggt aataaaatca atcaagtctg aaaatgttgc tctagaaatg 10200 
tgtatcgtac aatgaagcgt gtatctctga atcaa'aaacg tttagactgt gacaattctt 10260 
tccggaatcc aataagaaca aagtgtggaa aagggatgca ccgag'ctgaa aatttattga 10320 
actgtagttt gaggaactca atcaagacga accaaacatc attcatgcga agaaacatca 10380 
tttaatttta gaaactataa aaaaaaggtt aaaacattta aptggcattc agagacaatt 10440 
cttaaaactt tgatggcgca aaacaacaag acgtcatcca cagaagaaac tgtgccctac 10500 
atcttcaaat agatcggaca tcatctatct acaagtattc tttctttttc attctcttgt 10560 
ccctctccat cagccctcct tgatagtctt ttttcacatt ctcaccttct tttttgctgt 10620 
ctatttactt ttatgattgc cgcaccaata aatatctcta ctcactttca aactccttat 10680 
cgatttaatc catatgtact atacttttaa agacagatta tgcgtgaggc gaatgtgaca 10740 
tagtctagaa ttccggaaat tgtgcgcaaa attcttagaa acatatgtag ttgagtcgca 10800 
tttttcgaat ccagttttta tttcagcggc tgttttattt cagcggctgt tttatgactg 10860 
catttttaca agctgctaat tgcgaatttc cacgataacg gtttggttgg cacgtgtttc 10920 
ccaataaatt tgtaagcatt ttacataatg caccaatttt tccctccaca catcatttcc 10980 
gaatttgatt tttttttctt aaattacttg aacatattat cgaaaaaaat tgaaactgtt 11040 
cccatatcta caaaacatca ttcaatagta ttagattcag aaaactgcct acaaattgat 11100 
tttttaacca ttggttttca tgaactatct ttcaacaatt cagatgaact ttttttaaaa 11160 
ttcaaagtgc tattatgtta ttacaattag aagaaattaa aaattcaaat tatacaattt. 11220 
. aagtttggct ctatcacggg ttttaaatga atttcttgaa aacgcgcggc aaaagcttag 11280 
caaactgtct attgaactta cccttactat taatcattga aattttcaaa tgcgtatatt 11340 
taaaaaaaca gaacggtatg gattctcatg ataagacagt tccttaatat ctcgtagcaa 11400 
aaaaatcgac ccatctcttt gccccttgct tcatttctgc cgctttcttc gtgccaggaa 114 60 
ggtaggcatt ctcgtgaacc gacggtctga gtttgagtgt tttcctctta aaatgttcga 11520 
ttattatagg tcgccagaca taaatattat ttgttaatat ttcatggtgt cgaaaccaca 11580 
atcttttttc tttctacagg tggtttaata ttcgtttcct ttgtaaattt tcccatcggc 11640 

-108- 



02055701 A2 I > 



WO 02/055701 ' PCT/US01/49060 

• . ■ 

aggaattggc ataacctgat tttttacagt agaaacacat atctagaaga ttaagatctt 11700 
caattgtcca tctcgtttta aactgtcatt tctccggttq tttttggttt ttggtggcct 11760 
tttaccggtt ttgttacgtt ttcccgaaat gatataaatt tttgagcttg tcacttaatt 1 1820 
tatcaagcgg cgaccaaaga caaatacaca ccgcagaaga aaagtttgta aagagaaatg lffffftT 
gaagttgact atatgtacat atataaaatg gaatgaatag tcgcatbaca ttgacggtga 11940, 
aagtacatgc ggtgatgcct tcgaggacat acggacacac tttccttctc caatttcatc 12000 
gattaatttg tattgttata ttctgaattc caggttttca agacgaacat tgtgaatcat 12060 
tttcatcaca ttgttctggt acaaataaaa tgtatttcca agttttcccg. cacatttaca 12120 
tttttgcttc actacgtgtt tagcacgtta tgcgataaga ggagaagtgg ctgtaaaatg 12180 
gtttcgcact cccttttttc tctcccacaa tcattgcacg gtttctatga ccttttcgaa 12240 
ctacattcct tcttttttca tttggaggtt tgatadtgat atagtttgta gggtacccgc 12300 
ttattgtaac tctaactgtt atatccattg tcattga^tc atttat^ctgg ggggtattcc 123*60 
cttctatttg gcgaagtcta catttctcgt aaactgtaca aaataacgtt taaaagttct 124 20 
- ttgaatactg aaaataataa aacggtttga acaatccaac ttattttcca attttcaaaa , 12480 
actagccgat catttttgtg aaat^aaaact attttctgct tagtatcgaa taaaacatct 12540 
taacggtaaa aatgagtttt ttaaaacact tcttttaaac • cttttttgga tttjatttgaa 12600 
ittttggcttt gaaatagtct ttattccgga aatttcatat aatctcaagg gcaaatacat 12660 
tt'tgaaccgt gttcaaagtt atttaaaaat gcatttcgtt tgaatcactg agatttccaa 12720 
aaattcaatg agtcataata catggagcaa caaatctata agaatgtcca cttgcttttg 12780 
ttgttctttt gttttcaaac gttttggata taggctgtta ttttggccga caacaacttc 12840 
» agaatgtatt ttatgtttga gttttaaatg tgttactgaa aacaaataaa atgacagatt 12900 
aatatatctc ttggtaaaaa ttaattaatt ttgttactac tttctgttcc tgaaaagcct 12960. 
gaaaaatcag accatttttt gcagtgtctt tgttaatgat aaaaaccgac atgcacactg 13020 
accatcaaac cgattattat gaattaataa tttaatccga cagtttcctg tttctgttca 13080 
tttcatctca cgaatgcttc aattttcatt ttttttttcc gtgtctctta atacgtttaa 13140 
tttattacga gttcaagtaa caacgtttca atgaactctc atttagtttg aaaattaaaa 13200 
tatttaagtt ctaacatttt gaagagtgaa cgatggtgag aaacaaagtg gcaccagttg 13260 
aagatggagc caatattcaa agaaacttcg agcctccgcc accatataca actccaacgg 13320 
attcacctga agacaagatt cgatcaaatt caactgcaac aaccgcatct caaccagaat 13380 
ttcaaggatg ttggactatt gttgtagttg ccattttatt catcatcaat cttctcaatt 13440 
atatggatcg gtatacgatt gcaggttagt tgacagtaac tattttacct ccttggtaca 13500 
atattcaaaa cacaattcat gtaaccatgt gacagttttc cgtgcaataa aatttagtag 13560. 
agcctccaaa actataaaca acaaaaggta gaaagatgaa aatggccgga aaacgacatg 13620 
agagagtatc ttgtttgaac ttgtcatcat tagaagataa gaatttacgt aatgggtgag 13680 
tacaataact atgggacaaa catcattttt tgatctaaac ggagcattaa aaaatatgtg 13740 
catgtcaaat ttgaattagt atgtgtggaa cttttcctac tgcaataata taacgttttc 13800 
.aacaaaaaaa acattaaaaa cattattcgt agaggaaatt agcatttaaa aaccaataaa 13860 
atttttagga gtccttaatg atgtacaaac ctactacaat attagtgatg cttgggctgg 13920 
ac'tgattcaa acaacgttca tggtgttctt cattattttc tctccaattt gcggtttcct 13980 
aggagatcgt tacaatcgaa aatggatttt tgttgttgga attgcgattt gggtgtctgc 14040 
agtgtttgct tccactttta ttccatctaa tgtgagtttt t<gactctaat catactattt 14100 
cttcaatcaa cttcattttc agcaattctg gttgttcctt ttattccgcg gaattgttgg 14160 
aatcggagaa gcatcttatg caattatctc cccaactgtt attgctgata tgttcactgg 14220 
agtgctccgt tccagaatgc ttatggtctt ctactttgcg attccctttg gatgtggtct 14280 
gggattt^ta gttggatctg ctgtggctag ttggacagga cattggcaat ggggagtacg 14340 
. agtgactgga gttctcggaa tcgtttgtct cttgcttatt atcgtgtttg ttagagaacc 14400 
agaacgtgga aaagctgaac gtgaaaaggg agaaattgct gcatctacag aagcaacaag 14 4 60 
ttatttggat gatatgaagg atttgctttc aaagtgagtt tctttcaatt ttatttaaaa 14520 
agcaaccttt catatttcag tgcaacatat gtgacaagca gtctcggtta tactgcaact 14580 
'gttttcatgg tcggaactct cgcttggtgg gctccaatta caattcaata tgcagattct 14 64 0 
gctagaagga acggaacaat tacagaagat cagaaagcaa acattaactt ggtattcggt 14700 
gcactcactt gtgttggtgg cgttcttggt gttgctattg gtacattggt ttcaaatgta 147 60 
aatattgtcg ttgaagatta gaaaaatata catctttttt ttagatgtgg tctcgtggag 14820 
ttggtccttt taagcacatt caaacagttc gtgcagatgc tcttgtctgt gctatcggag 14 880 
cagcaatttg cattccaaca ctcattcttg ctattcaaaa cattgagagc aacatgaatt 14940 
ttgcttgggt gtgtaaaatt tggactatga aacatttcaa attcgctttt ttttagggaa 15000 
tgttatttat ttgcattgtc gccagtagtt ttaactgggc tacaaatgtc gatctgttac 15060 
tggtaagttt cactcgatat tgaaacaaat tcccaaataa atgttttttg cagagtgtgg 15120 
tagttcctca gagaagaagt tcagcctcat catggcaaat tttaatttct catatgtttg 15180 
gagatgcttc tggtccttac attctgggat tgatatcaga tgcaatcaga ggcaatgaag 152 4 0 
atacggcaca ggctcattat aaatcattgg tcacctcatt ttggttgtgt gttggaactt 15300 
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tggtgttgtc agttattcta tttggaatat cagcaataac agttgtaaaa gataaagcca 15360 
gattcaacga aattatgtgt aagtttcaga ctttttatag cttggcactg aatcaaaata 15420 
t tcatttgca gtggcacaag ctaacaagga caacacatca agcgggacac ttccaattga 1 5480 
agacagaaac acagaagacg aaactggtt9 cgaagttcaa catatgtaat tctaatattt 1554 0 
tagtaaagct ggttgctcaa cccctcttcc cctacatatt acattt'ttot cacagtttat 15600, 
tgcgggtctg ctcttttttt tgtatttttg tatcccccat cagtcaatca ttcgcataaa 15660 
ttttgcgttt ttattgccaa atcttttgta catttcggaa tattcgtgct gtcagagaaa 15720 
atcaatgata gcagttaacg tgattcttta tgcttttcat ag.tgtttttg aaaaatatga 15780 
aaacaagttt ctacttggaa cttttttatg gagcctcctt gttgtgtggc ggacaaatga 1584 0 
caacagaagc tgggtaaaaa gtaagtttta ttcaaaacat. actatacttc gccttcccac 15900 
ctctcgcccg gaaatgtttt cggaaaagta tagtaaaggt cagttgaagt gttaacatgg 15960 
taacaaaaat ggcatacaaa actaccggtc aggcgcg^gg aggcg^ataa agaggcgagd 16020 
atgaatcacc aatgtggaac gccgttattt tggtttattg tagaaatatt actttcgaac 16080 
aaaatcgtga cgccaaacaa aaataacttt tgcgttagct tgcagtttca atttttaaaa, 16140 
ttttccatgg ttgtccaaat ttc^gcatag caaagtactt gtcacgtttt tcaacataag 16200 
aacagttttc agaaaccaac tttgattcat ttttcattta' cataatttca gatacaattg 16260 
• aaatctgca.a tattgtaaga ttcatatcag ttccattatt tcattcatct ccagttaatt 16320 
ggctttattt atttactaca tcttcagcat tttcttttat tgctcaaagt tggtttctaa 16380 
aaaatcaaaa gtttttaaac tgaccaattc agaaaaagaa tttactggat tcacattaat 16440 
gatcgctctt catttgattc aatattttct catcacagga ttaagtgcaa tgacatattg 16500 
• gaatttagga aaacaattat ttaatttatt tgtttggcca gatgattctc gtcaactaat 16560 
cattcagttg gcaatgattt tgtctactgc aactatgttc tgtcacgtat ttatcagtat 16620 
ttggtcaaag ctttcagaga ttatcaatat aagtttgttt tttttacatt gtttctctga 16680 
aacagtatta tatattttgc cttcagattg gtttctagtt tctacca-tcg ttcatcctgc 16740 
tattagtttc tacatctttc cgagatttca acatgttcat atgttgactg tttggttttc 16800 
ttttcttatg atttcagtta aatttaatca gaagatgttc attaaagttc tcggagtctt 16860 
tttcattgtt ctatactcca gccttttgga atcctgggaa ttatattttt tggatttgaa 16920 
gtcacttttc aaaataaaat atttattcta acaagtgttt acgtagtaag cattaactta 16980 
aatatggtga aatggaaaat atggtgctta aaaaacatca aaatgaacac agactaccga 17040 
atttgataat gacacttcgg taatccttct caaaaatttt tattactgtt ttaaatgtta 17100 
gaataaaagg ctcaaaaaca aacctccaac tattagcagc cggaagaagt aattaaattg 17160 
aaacaaattt agggccgctg caagcctttt ctttataaat tttgaacaat tataaattca 17220 
gtctttgaaa gccaatttga gttttattaa gcagtatata tttttagata gtcggcattt 17280 
gaatctatca acaattttaa tcatcatcat catgtttatc atcaaeagta cctccactgt 17340 
ttcgtacaat atcatcatat cgtaaataag ttcttctgtc agcatttgat ggacctggac 17400 
gaacacaatc cgcatatcga agacgatgaa atcttgtagc ataggttgga ggatctctga 174 60 
aattgttcca attttcttcc agtgttattc cgaaagaatc gtaccggtaa tgtaaaatcc 17520 
atttctcaaa aattccttca aaacaagctt cagtgtcaga atttagaaga agtatctgaa 17580 
tcaaacaaac ctaatcagat gaaactttaa acttcatacc caattaacac accaagattg 17640 
gcttaaactg aatccaggcc accagaatag tcccatattg acaatatcgg attgtttaaa 17700 
gaaaatcttg aaatgatcac gagatccttg ttcaaaatcg t.ttcttcctg ggttatcgag 17760 
atgaaaagtt tcagaatcat gtccagattc atagaatact ttcaaatatg atgcggcgtc 17 820 
tgttccagca ccatctttgt cacaagtctc tatgactaca tcccagagac gaccgtcatc 17 880 
tacaaaactg aataattgcc acttatttga atttgaatac aacaattttc agaaaagttt 17940 
tattatgatt gttgttcatt tcagcattgt atgcgtgatt ttaaacgatt ttcttgattg 18000 
acacattagt tgaactcaaa gttaagatta aaaataatat taaactacat cattgt.tcac 18060 
aacttcctgg aaagttttag tgaggttcat attcgaatat tctcttcaac cccattttta 18120 
aaagttgact tgtgttttaa ttgtttaccc tttctgatca tttctatcat tgaaattgga 18180 
agtcatttca cttttcttcc cacagctcat agttacaaaa acaagagtga ctgaaatttc 18240 
'aaaaactact tgtggtttca gtttttattt tattttatga atccagttag tcgaaaaaaa 18300 
aacaatcaaa ttaattccat gcaacccaaa atattttaaa gttcttggta tttatgcagt 18360 
ttttaaaact gagaatcaaa agttaaaata attgttttaa acttacgatg aacaacagtt 18420 
tcattagttg gagaaaattg tgcaaaaacg atagtaacta gtaagcagta tattaaaaat 18480 
gagtggaaca ttttggggag ttggtgaaaa aattatagga agagatgaaa gacactttgt 18540 
gttctggaat cttctcctga cgacaactaa aaatccatca gcaattgaaa tgatcgattg 18 600 
atgattgatg gatgagtgtg atcatatgct tgctcaaaat actttcaaag cgaaaaaaaa 18660 
acacaaggga aacatgagtt ctaaattttt gatcggaaca gatggtagaa attgaaatgt 18720 
ttggattcgt catcggtaaa atgaatataa acatttatta gtgcaaataa acaatcaaaa 18780 
tttaatatta aacaagatga aaaatatgac ctgatttttt gaaattattc tggttgtgaa 18840 
aaaatatttg tatgattata caaatcgaaa tagttcaaac tctcaaaatg atgtctctcc 18900 
ggaatctcgt gtattttcac gagtttcagt tctttcaaaa aacgtgccaa atattttctt 18960 
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ataaaacaaa tagtctttga ctaaatcaaa gattaccgtg atatgagatc tt'aaaagtag 19020 
cattggttat aagctattaa ta&gattaat acgagatact gtaattgtag, atttag^tta 19080 
qcttcatata tctctaattt aactttttaa gcctcgagtt aagtatttta catttgaaca 19140 
attgtgtaat caaaacgcaa tcctptacat aattattttc agcgctgtat tttctcltcat 19200 
caatcaaaaa atttaatttt taaaaataat ttgatttacc gagcatctga acagaagcaa 19260 
ggacacgtga gtttgctgat tttgctgata agaaatcaat tatttaggtg cattggtga.t 19320 
cttcattctt tttctaaact tatcagaaaa aatatattct ctat'ttttaa catttattgt 19380 
gttctacatg ttctacatgg ttttgtatat tcgtaacaat tttttaagta gaaaaaaata 19440 
tatactgata atcagcaaaa actagttttt gttcacattt tgatatcatc acttttttat 19500 
tatattcaac tagagtttaa aatattaatt ttattgattc ttat;cgatat ttttgactat 19560 
tgattcccgt gttgatcgct t.tgtgataat aaaatttttg, gaacaaatta gccaatcgtg 19620 
cattttttgt catcaattca atat|:atctg atgattcctg ttcctgttag ttgcaaaatc 19680 ( 
atgaacaagt atcacttttc ctatatttta tttcatattt ttcccgtcat gaaacgttat 19740 1 
gttttgcctc atataaatac aaaaatgaac aaaaaagtgg tgttcgactt gaaaacaaga 19800 
tttttaggac agaaggtcgg atgggaaaag gttttcgttc gacggacatc . acacaaaaaa 19860 
gtatcacacc gcacctca^t ctatttactt tgttgataga atggaaaaac attcgactga 19920 
aaatgaatac agagtaagtt tatttttcca aaaatgtttg aaataatgat tcaattttta 19980 
ggtgtataag gaacgatggt ttattctttt ggcgactact gcattgattt gctcaaatat 2004 0 
tatggtgagt aatggaatat tggtattttc atatttagat tggcgtgaat tttgctaaga 2010a 
agtccgatgt ttttccaaaa ttcaaataca ttacaaagta gtgccaaaat taaacatttt 20160 
ttagattttt tgttttagct gaaaagtttg acgaggtgtt cataaatttat aagagttttt 20220 
gaaaacaaac ctttaattat gatgttcctg cattfcttac aatttttgag gtaatttaat 20280 
agaagacgag cctgccacaa aacggtttta ccagattttt caaactttaa caattattta 20340 
ctggatttaa aataaaacat atbaatgttt tttctgcagc aatgggtgag tttctcggct 20400 
caaatcgacc aaacaaatat gtttttctgt ggcccaaatg aatataagaa ttgttca^ca 204 60 
gccttcctat ctaatcaaat ctatcaggtt agatttttca attttgatcg gtataattta 20520 
ttatattatt tttagatagt tgcagtaatt gtgagcatta taggaatgta ttttgcaaca 20580 
gtttttggaa cacttccaac tgtatgtttc ttcttctaat ttacagtttt ctaatttaca 20640 
cgtcttctag ctccgtctct cagcagtttt . caatattgtt ggtgcagcaa tccgactcat 20700 
tgcgtcactt ccaagccttc aaaacttttt ctggagacag tttttaatga ttattggaac 20760 
tagcattgcc gcggctgctc aaatgtattt cgtcgtcttc tcgaaaattg cggaaagttg 20820 
gttttatcca agacatcgag cttctgcaaa tgttgcatgt tcaaatagtc ttgagt.tagg 20880 
agttgtactt ggaactgttt taccatcaat tattgtacca gcttctttta caaaagaciat 20940 
tgtcagttcc tggacatttt ttcttttggt tagttatgtc tctgtcatga ttaaacaact 21000 
atattacaat ttcagaattc aataatagca gttgtatgta taattccatt atttttatta 21060 
tttgttctat gccgtcgatc ggttccaaaa acgccaccat cagcctcatc gcaacatgag 21120 
tcaagtggtc ctgtgacatc tggtattttt aaatgtttaa agtaagcccg tacttagctg 21180 
tttaatttta attcttctga tagttctcag ggatc§tcag tttptcattc aaatattcgt 21240 
ttattcaatt aattttgcaa ttgcaaatgg cctcatttat acttctaatg ccatcaacta 21300 
tagaggttat aatttgaaag ggtacgtcta gtaaggtttt ctgctaggta caagaaatcc 21360 
tactctgtca agtaggcagg catttttaga ' cctgcctagg aggtttaaac tcaagacagc 21420 
cctttgcgta taattatttt gttcaggtat ccaatagcaa tcgccacgat ggtctgcatg 21480 
ttttcagcct atttcgtagg agtaattgca gatagaacaa gaaagttcaa agttagtttc 21540 
aaaaaatcta atcttatctg atatccattt ttctagttga tcgcaataat aaacgcactt 21600 
gttgtagccg tttgtgtact ggtacttcgt otggtaaggc tttttcaaaa taacattaaa 21660 
tttattcatt aatgctttta gtatttaata aaaacttata cgggatggta tgattcagta 21720 
attgtttgta cacttttaag cattatcagt aagttatttt aattttgctt atttagaata 21780 
tattttcaaa aattcagtgt cctgttgtgc tatacataca ccgattggaa atgfagatggg 2184 0 
agttgagaca acttacccag ttcaagaatc aatttctaca ggagttttga atactttcgg 21900 
gtgagccttt ggttcttttt acggcgaaaa aaagtgtgat atacccgctt ttttgggtct 21960 
cgcagcgaat ttgtatttcg gtttgtgctt ttctgatatt aattttatac gagttttatt 22020 
caaaacgaaa cgttaaaaag tttttcagtc aagcttggtt attctgctta tattttatta 22080 
tgtattcttt acaagaatca aattgggttt acaagaacaa ccgtaaagga ggaagctggg 2214 0 
aatgtaagtt tctttttaaa ttttaaaaca atgtttcaac taacgcggtc acttacaagc 22200 
ctgtgtagct ttagtttcta actaattttc aaataattaa aagttttttt tctaatagta 22260 
aataaaaata taaatattac gggaacacta aatactgaaa gtgcgtattg cacaacatat 22320 
ttgacgcgaa aaatgatttg tagcgtagcg aaaactacag caacttttta gatgactacc 22380 
gtagaactgg tttacggaac tcgtgtttca atcgataaaa tattaaaaga aaacacaaaa 22440 
atgacaaaac aataaaaaaa aaactttaaa aaaaattgaa tattgttgtc acaattcgca 22500 
caaatacttg tttcgtaact agagccccac ggcccgagaa gtggtacctg tacgcaattt 22560 
gtctaccgta cacctggacg tttgggcgcg tttttctcaa aaacggctgg tccagttttt 22620 
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ttgtgatgca ■ tataaaaaat- gttcgaaatt aaattcaaaa ttttttggac cdaagctttt 22680 
tccttaaaac gagcccaaap ct^gctaaac tgcaattatc aatagagcgc, gtttacaetg 22740 
atgtaccctt- tgcegggctg tgagcccgta aaccgacacc agcactaagt acagaagt ca 22800 
ttaaaataat tgttttaatg ttcg ( ctacga tatatttccg ggtcaaatat gttgc^caat 
acgcattttc aaaacgtagt gttcctgtta attatgtttt catttctgct tcaaaaacta 22920 
aatattttat attacagtag cccttgactt ctgggcagga atgtctattc taaactttat 22980 
tgtggctgta atattcttga gaccccgata taaccgttta caasttggaag aagaagctca 23040 
aaatacacgg gaagcaattc gagattcagt gtatacaatt tgtaagtaat gaactgcttt 23100 
tatttatcaa tgatgtaaat aataaaaaaa tgaatttgct gaaaaaaata aagtcatgag 23160 
aaacaggaaa tgcgatatga gacaatggtg aaaaaagatt gggactagtt gaaactgggt 23220 
gcaaatgaac tggtgaagtc ^agaaacata atataaacat ( gtttgttggg caaaataatt 23280 
tcaaattgag actatggtac atatpatttt ttaagtgaga eaatacaatt cctctaactt 23340 
cttaagtaca aaatgctata aaatgcaaag taaatgcttg ctactatcgc atttcgatat 23400 ■ 
ccgacgcagc cgttagagct gtaacaggtt cctcattttc taatgccaat ttggcatttt 234 60 
ccagttctag | acgacgata,t ctgtaattga taaaattatt tattgagttt. tgataaacgt 23520 
ttataccgtg gtctcaaaaa gaacagtgat aagatgacag atacaattga gagcccacat 23580 
gttacatcga gagcaadtga aaaatattca tttacttcta ttttgaagat gataaaactt 23640 
acatttcca^ gattgtgcat agaacaagtg caacttttct gtgtaatttt g/tgcaaatgg 23700 
aataatgaac atgaaaagcg atccgaatat aacaagaaca cctgatgacg ttgcttccat 237 6Q 
tactgggaat gtcgtctcaa caccaagttc cactccaatc gggaattgag gtattgaaaa 23820 . 
ggcacctgaa gaaatatagg aattttgtta tacaattaaa tttaaaaaati ccttaccgag 23880 
acatccacaa agagtgtaaa cgattatgct atcaa'agagc ccagtccgtg gttgatttag 23940 
gaactgcaag tgtagccttt ttagattaaa aaaatccata gaggaaattt agacatattg 24000 
acacaatttt tttcgaagac ccbggtagaa gaattttaac tcaccattct caatgtaatg 24060 
actgaacatg aaaatccaac agtacaaact ctgattattt cctaaatttt aaaattatat 24120 
tttctttttc tgattcatac cgaccactca ccttgaactt gcgtgttttg tcggcaatat 24180 
gaccagcgag cagtgatgtt aacgtgccaa cgattgcaca cacggcagtt ggatatctga 24240 
aaaataaatg aattactcat atgcttctag aactctgcaa agttgttacc cagccatctc 24300 
ataaccttga tctttgagtg gaccatcgag gaaaatcatt agactccaca gaagcgaaaa 24360 , 
cgcaaatgca aagagagtca tttggatgaa aaattgagca ttactgaaat taaagtattt 24420 
aaagtttttg gcgtgatact aaaaacctac aaaatacatt gaagaataga tttaaagaaa 24480 
ccaatgttgt tctgatgagc agcagaagaa gctgatggt'g gagtgggtgg cagtttagtt 24540 
cgaacgaata gtgctagaac aaatgggaac aaggctagac attccattcc aagagtctga 24 600 
aaatatgaat tgtcaatgat aattaaattg agtactaaca aatgtgaaga acatccaact 24 660 
attcgaatca atagtcttat tgtgaccaaa aagaatggat ggcacaattg ttccgagggc .24720 
aacaccggca ggatttgctg aaattgatag gaattactta cgaacattaa aattccacta 24780 
accaacaaaa gaaagtacat ttgcaattgc tctctgatea ccagggaacc ageattcagc 24840 
aatttttgat ggaagaacta aaaagaatgc ttgtgttgaa gcajgcgataa acgatcctgc 24900 
atgaagaaga cattcacgaa caaaatgtga ttttatgaaa ggaatfcgagg caatcatccg 24960 
aatcgatgct ccaattacat ttaaagttgt accaagaaga cc 25002 



<210> 32 ' 
<211> 1237 
<212> PRT 

<213> Rattus norvegicus 
<400> 32 

Met Ala Arg Ala Lys Leu Pro Arg Ser Pro Ser Glu Gly Lys Ala Gly 

1 5 10 15 

Pro Gly Asp Thr Pro Ala Gly Ser Ala Ala Pro Glu Glu Pro His Gly 

20 25 30 

Leu Ser Pro Leu Leu Pro Thr Arg Gly Gly Gly Ser Val Gly Ser Asp 

35 40 45 

Val Gly Gin Arg Leu His Val Glu Asp Phe Ser Leu Asp Ser Ser Leu 

50 55 60 

Ser Gin Val Gin Val Glu Phe Tyr Val Asn Glu Asn Thr Phe Lys Glu 
65 70 75 80 
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Arg Leu Lys Leu Phe Phe lie Lys Asn Gin Arg Ser Ser Leu. Arg lie 

85 / 90 .", 95 

Arg Leu E>he Asn Phe Ser Leu Lys Leu Leu Thr Cys Leu Leu Tyr lie 

100 . 1Q5 , ~ 110 

Val Arg Val Leu Leu Asp Asn Pro Asp Gin Gly lie Gl</ Cys Trp Gly 

* 115 120 125 

Cys. Thr Lys Tyr Asn Tyr Thr Phe Asn Gly Ser Ser Ser Glu Phe His 

130 135 1*40 

Trp Ala Pro lie Leu Trp Val Glu Arg Lys Met Ala Leu Trp Val lie . 
145 150 155 160 

Gin Val lie Val Ala Thr lie Ser Phe Leu' Glu Thr Met Leu Leu lie 

165 . . 170 t , 175 

Tyr Leu Ser Tyr Lys Gly Asn lie Trp Glu Gin He Phe His Val Ser 

180 185 190 

Phe Val Leu Glu Met lie As ( n Thr Leu Pro Phe lie He Thr Val Phe 

195 200 .• ' 205 

fTrp Pro Pro Leu Arg Asn Leu Phe lie Pro Val' Phe Leu Asn Cys Trp 

1 210 215 ' 220 

Leu Ala Lys His Ala Leu Glu Asn Met lie. Asn Asp Phe His Arg Ala 
225 230 235 240 

• He Leu Arg Thr Gin Ser Ala Met Phe Asn Gin Val Leu lie Leu Phe 
245 - 250 255 

Cys Thr Leu Leu Cys Leu Val Phe Thr Gly Thr Cys Gly He Gin His 

260 265 i . 270 

Leu Glu Arg Ala Gly Gly Asn Leu Asn . Leu Leu Thr Ser Phe Tyr Phe 

275 ' 280 285 

Cys He Val Thr Phe Ser Thr Val Gly Phe Gly Asp Y al Tnr Pro L Y S 

290 295 300 

He Trp Pro Ser Gin Leu Leu Val Val He Leu He Cys Val Thr Leu 
305 ~ 310 315 320 

Val Val Leu Pro Leu Gin Phe Glu Glu Leu Val Tyr Leu Trp Met Glu 

325 330 335 

Arg Gin Lys Ser Gly Gly Asn Tyr Ser Arg His Arg Ala Arg Thr Glu 

340 345 350 

Lys His Val Val Leu Cys Val Ser Ser Leu Lys He Asp Leu Leu Met 

355 360 365 

Asp Phe Leu Asn Glu Phe Tyr Ala His Pro Arg Leu Gin Asp Tyr Tyr 

370 375 380 

Vai Val He Leu Cys Pro Ser Glu Met Asp Val Gin Val Arg Arg Val 
385 390 395 400 

Leu Gin lie Pro Leu Trp Ser Gin Arg Val lie Tyr Leu Gin Gly Ser 

405 410 r 415 

Ala Leu Lys Asp Gin Asp Leu Met Arg Ala Lys Met Asp Asn Gly Glu 

420 425 4 30 

Ala Cys "Phe He Leu Ser Ser Arg Asn Glu Val Asp Arg Thr Ala Ala 

435 440 445 

Asp His Gin Thr He Leu Arg Ala Trp Ala Val Lys Asp Phe Ala Pro 

4 50 " 4 55 4 60 

Asn Cys Pro Leu Tyr Val Gin He Leu Lys Pro Glu Asn Lys Phe His 
4 65 " 470 475 480 

Val Lys Phe Ala Asp His Val Val Cys Glu Glu Glu Cys Lys Tyr Ala 

485 490 495 

Met Leu Ala Leu Asn Cys He Cys Pro Ala Thr Ser Thr Leu lie Thr 

500 505 510 

Leu Leu Val His Thr Ser Arg Gly Gin Glu Gly Gin Glu Ser Pro Glu 

515 520 525 

Gin Trp Gin Arg Met Tyr Gly Arg Cys Ser Gly Asn Glu Val Tyr His 

530 535 540 

lie Arg Met Gly Asp Ser Lys Phe Phe Arg Glu Tyr Glu Gly Lys Ser 
545 550 555 560 
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Phe Thr Tyr Ala Ala Phe His Ala His Lys Lys Tyr Gly Val Cys Leu 

565 570 .". 575 

lie- Gly Leu Lys Arg Glu G lu Asn Lys Ser He Leu Leu Asn Pro Gl y 

580 ' ,58*5 v . ~ v 5915 

Pro Arg His He Leu Ala Ala Ser Asp Thr . Cys Phe Tyr lie Asn He 

1 595 . 600 ' 605 

Thr Lys Glu Glu Asn Ser Ala Phe lie Phe Lys Gin Glu Glu Lys Gin 

610 615 .620 

Asn Arg Arg Gly Leu Ala Gly Gin Ala Leu Tyr Glu Gly Pro Ser Arg 
625 630 635. 640 

Leu Pro Vai His Ser . He He Ala Ser Met" Val Ala Met Asp Leu Gin 

645. . 650 , , 655 ' 

Asn Thr Asp Cys Arg Pro Ser Gin Gly Gly Ser Gly. Gly Gly Gly Gly 

660 665 670 , 

Lys Leu Thr Leu Pro Thr G^u Asn Gly Ser Gly Ser Arg Arg. Pro Ser 

675 680 ' 685 

file Ala Pro Val Leu Glu Leu Ala Asp Ser Ser Ala Leu Leu Pro Cys 

' 690 . 695 ' 700 

Asp Leu Leu Ser Asp Gin Ser Glu Asp Glu Val Thr Pro Ser Asp Asp 
705 A 710 715 . 720 

Glu Gly Leu Ser Val Val Giu Tyr Val Lys Gly Tyr Pro Pro Asn Ser 

725 . 730 735 

Pro Tyr He Gly Ser Ser Pro Thr Leu Cys His Leu Leu Pro Val Lys 

740 745 . • 750 

Ala Pro Phe Cys Cys Leu Arg Leu Asp Lys Gly Cys Lys His Asn Ser 

755 • 760 765 

Tyr Glu Asp Ala Lys Ala Tyr Gly Phe Lys Asn Lys ^eu lie lie Val 

770 775 780 

Ser Ala Glu Thr Ala Gly Asn Gly Leu Tyr Asn Phe He Val Pro Leu 
785 790 795 800 

Arg Ala Tyr Tyr Arg Ser Arg Arg Glu Leu Asn Pro He Val Leu Leu 

805 810 815 

Leu Asp Asn Lys Pro Asp His His Phe Leu Glu Ala He Cys Cys Phe 

820 825 830 

Pro Met Val Tyr Tyr Met Glu Gly Ser Val Asp Asn Leu Asp Ser Leu 

835 840 845 

Leu Gin Cys Gly lie lie Tyr Ala Asp Asn Leu Val Val Val Asp Lys 

850 855 860 

Glu Ser Thr Met Ser Ala Glu Glu Asp Tyr Met Ala Asp Ala Lys Thr 
865 870 875 880 

He Val Asn Val Gin Thr Met Phe Arg Leu Phe Pro Ser Leu Ser lie 

885 890 895 

Thr Thr Glu Leu Thr His Pro Ser Asn Met Arg Phe Met Gin Phe Arg 

900 905 910 

Ala Lys" Asp Ser Tyr Ser Leu Ala Leu Ser Lys Leu Glu Lys Gin Glu 

915 920 925 

Arg Glu Asn Gly Ser Asn Leu Ala Phe Met Phe Arg Leu Pro Phe Ala 

930 935 940 

Ala Gly Arg Val Phe Ser He Ser Met Leu Asp Thr Leu Leu Tyr Gin 
'945 950 955 960 

Ser Phe Val Lys Asp Tyr Met He Thr lie Thr Arg Leu Leu Leu Gly 

965 970 975 

Leu Asp Thr Thr Pro Gly Ser Gly Tyr Leu Cys Ala Met Lys Val Thr 

980 985 990 

Glu Asp Asp Leu Trp He Arg Thr Tyr Gly Arg Leu Phe Gin Lys Leu 

995 1000 1005 

Cys Ser Ser Ser Ala Glu He Pro lie Gly He Tyr Arg Thr Glu Cys 

1010 1015 1020 

His Val Phe Ser Ser Glu Pro His Asp Leu Arg Ala Gin Ser Gin He 
1025 1030 1035 1040 
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Ser Val Asp Met Glu Asp Cys Glu Asp Thr Arg Glu Ala. Lys Gly Pro 

1045 m . 1050 ( .1055 / 

Trp Gly Thr Arg Ala ' Ala • Ser "Gly Gly Gly Ser Thr His Gly Arg His ' 

1060 1065 10*70 ' 

Gly Gly Ser Ala Asp Pro Val Glu His Pro Leu lieu Arg Arg Lys » S>er 

107^ 1080 ' • • ■ 10.85 

Leu Gin Trp Ala Arg Lys Leu Ser Arg Lys Ser Ser »Lys Gin Ala Gly 

1090 1095 1100 

Lys Ala Pro Met Thr Thr Asp Trp lie Thr Gin Gin Arg Leu Ser Leu 
1105 1110 1115 • 1120 

Tyr Arg Arg Ser Glu Arg. Gin Glu Leu Ser Glu Leu* Val Lys Asn Arg ■ 

" 1125 ' , 1130 . ' , 1135 

Met Lys His Leu Gly Leu Prb Thr Thr Gly Tyr Glu Asp Val Ala Asn 

114Q 1145 1150 

Leu Thr Ala Ser Asp Val Met Asn Arg Val Asn Leu Gly Tyr Leu Gin 

115*5 ' 1160 1165 

Asp Glu Met Asn Asp' His His Gin Asn Thr Leu Ser " Tyr Val Leu lie 

1170 1175 1180 

Asn Pro Pto Pro Asp Thr Arg Leu Glu .Pro Asn Asp lie Val Tyr Leu 
1185 1190 1195- 1200 

lie Arg Ser Asp Pro Leu Ala His Val Thr Ser Ser Ser Gin Ser Arg 

1205 12,10 , 1215 

Lys Ser Ser Cys Ser Asn Lys Leu Ser Ser Cys Asn Pro Glu Thr Arg 

1220 , • * . 1225 1230 

Asp Glu Thr Gin Leu ■ * " ( 

1235 i 
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<211> 638 
<212> PRT 
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<400> 33 



Met 


Gly 


Lys 


lie 


Glu 


Asn 


Asn 


Glu Arg 


Val 


He 


Leu Asn 


Val 


Gly Gly 


1 








5 










10 








15 




Thr 


Arg 


His 


Glu 


Thr 


Tyr Arg 


Ser 


Thr 


Leu 


Lys 


Thr Leu 


Pro 


Gly 


Thr 








20 










25 








30 






Arg 


Leu 


Ala 


Leu 


Leu 


Ala 


Ser 


Ser 


Glu 


Pro 


Gin 


Gly' Asp 


Cys 


Leu 


Thr 






35 










40 . 








45 








Ala 


Ala 


Gly 


Asp 


Lys 


Leu 


Gin 


Pro 


Leu 


Pro 


Pro 


Pro Leu 


Ser 


Pro 


Pro 




50 










55 










60 








Pro 


Arg 


Pro 


Pro 


Pro 


Leu 


Ser 


Pro 


Val 


Pro 


Ser 


Gly Cys 


Phe 


Glu Gly 


65 










70 










75 . 








80 


Gly 


Ala 


Gly 


Asn 


Cys 


Ser 


Ser 


His 


Gly 


Gly 


Asn 


Gly Ser 


Asp 


His 


Pro 










85 










90 








95 




Gly 


Gly 


Gly 


Arg 


Glu 


Phe 


Phe 


Phe 


Asp 


Arg 


His 


Pro Gly 


Val 


Phe 


Ala 








100 










105 








110 






Tyr 


Val 


Leu 


Asn 


Tyr 


Tyr 


Arg 


Thr Gly 


Lys 


Leu 


His Cys 


Pro 


Ala 


Asp 




115 










120 








125 








Val 


Cys 


Gly 


Pro 
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Gly 


He 


Asp 




130 










135 










140 








Glu 


Thr 


Asp 


Val 


Glu 


Pro 


Cys 


Cys 


Trp 
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475 










480 


Ser 


Leu 


Ala 


Met 


Ala 
4 85 


Lys 


Gin 


Lys 


Leu 


Pro 
490 


Arg 


Lys 


Arg 


Lys 
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Glu 
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Cys 
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Ser 
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<400> 34 

Met Asp Cys Ser Leu Leu £rg Thr Leu Val Arg Arg Tyr Cys Ala Gly 

1 — --5 M It! > • ' ' 15- ; 

Glu Glu Asn Trp Val Asp Ser Arg Thr lie Tyr Val Gly His Lys Glu 

20. ' 1 25 '■ . 30 . • ' 

Pro Pro Pro, Gly Ala Glu Ala Tyr lie 'Pro Gin Arg Tyr Pro Asp Asn 

35 40 .4.5 

Arg lie Val Ser Ser Lys Tyr Thr Phe Trp" Asn Phe lie Pro Lys Asn 

50 55 60 

Leu Phe Glu Gin Phe Arg Arg lie Ala Asn Phe Tyr Phe Leu lie lie 
65 70 .75 80 • 

Phe Leu Val Gin Leu lie" lie Asp Thr Pro Thr' Ser Pro Val Thr Ser 

85 \ 90 95 

Gly Leu Pro Leu ,Phe Phe Val lie Thr Val Thr Ala lie Lys Gin Gly 

100 105 110 

Tyr Glu Asp' Trp Leu Arg His ,Lys Ala Asp Asn Ala Met Asn Gin Cys 

115 . 1 120 125 

Pro Val His Phe He Gin His Gly Lys Leu Val Arg Lys Gin Ser Arg 

130 1 135 . 140 

Lys Leu Arg Val Gly Asp He Val Met Val Lys Glu Asp Glu Thr Phe 
145 ' 150 155 . i - 160 

Pro Cys Asp Leu lie Phe Leu Ser Ser .to Arg Ala Asp Gly Thr Cys 

.165. 170 175 

His Val Thr Thr Ala Ser I^eu 1 Asp Gly Glu Ser Ser His Lys Thr His 
180 185 '190 , 

Tyr Ala Val Gin Asp, Thr Lys Gly Phe His Thr Glu Ala Asp Val Asp 

195 200 ( ■ ■ 205 ' 

Ser Leu His Ala Thr He Glu Cys Glu Gin Pro Gin Pro Asp Leu Tyr 

210 215 220 

Lys Phe Val Gly Arg lie Asn Val Tyr Asn Asp Leu Asn Asp Pro Val 
225 230 235 240 

Val Arg Pro Leu Gly Ser Glu Asn Leu Leu Leu Arg Gly Ala Thr Leu 

245 250 1 255 

Lys Asn Thr Glu Lys He Phe Gly Val' Ala He Tyr Thr Gly Met Glu 
260 265 270 

' Thr Lys Met Ala Leu Asn Tyr Gin Ser Lys Ser Gin Lys Arg Ser Ala 
275 ' 280 285 

Val Glu Lys Ser Met Asn Thr Phe Leu lie Val Tyr Leu Cys He Leu 

290 295 300' . 

Val Ser Lys Ala Leu He Asn Thr. Val Leu Lys Tyr Val Trp Gin Ser 
305 310 ' 315 320 

Glu Pro Phe Arg Asp Glu Pro Trp Tyr Asn Glu Lys Thr Glu Ser Glu 

325 330 335 

Arg Gin Arg Asn Leu Phe Leu Arg Ala Phe Thr Asp Phe Leu Ala Phe 

340 34,5 • . . 350 

Met Val Leu Phe Asn Tyr lie lie Pro Val Ser Met Tyr Val Thr Val 

355 360 365 

Glu Met Gin Lys Phe Leu Giy Ser Tyr Phe lie Thr Trp Asp Glu Asp 

370 375 380 

Met Phe Asp Glu Glu Met Gly Glu Gly Pro Leu Val Asn Thr Ser Asp 
385 ~ 390 395 400 

Leu Asn Glu Glu Leu Gly Gin Val Glu Tyr He Phe Thr Asp Lys Thr 

405 410 415 

Gly Thr Leu Thr Glu Asn Asn Met Ala Phe Lys Glu Cys Cys lie Glu 

420 425 430 

Gly His Val Tyr Val Pro His Val lie Cys Asn Gly Gin Val Leu Pro 

435 440 445 

Asp Ser Ser Gly He Asp Met lie Asp Ser Ser Pro Gly Val Cys Gly 

450 455 460 

Arg Glu Arg Glu Glu Leu Phe Phe Arg Ala He Cys Leu Cys His Thr 
465 470 475 480 
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Asp 


Asp Asp 
485 M ' 
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Gly 
Val 


Asp 
490 
Tyr 


Asp Val Asp 
lie Ser Ser 


Gly 
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Pro Gin 
495 

Pro Asp 






500 
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510 






Glu Val 
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Glu 


Gly Val Gin Arg Leu Gly Phe 


Thr Tyr 'Leu 
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Met 


Glu 


lie 


Leu , 
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Arg. 


Glu 


Asn Asp 


lie 


530 
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Glu Arg 
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Glu 
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Leu 


Glu 


Val 
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Thr 


Phe 


Asp 


Ser 


Val 


Arg 


Arg 


545 








550 










555 




• 






560 


Arg Met 


Ser 


Val 


He 


Val, Lys 


Ser Thr 


Thr Gly Glu 


He 


Tyr Leu 


Phe 






565 




i 






570 










5,75 




Cys Lys Gly Ala 


Asp 


Ser 


Ser 


lie 


Phe 


Pro Arg Val 


He 


Glu 


Gly Lys 






580 










585 










590 






Val Asp Gin, Val Arg 


per 


Arg 


Val 


Glu Arg Asn Ala 


Val 


Glu Gly 


Leu 
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600 
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Arg Thr 


Leu 


Cys 


Va'l 


Ala 


Tyr 


Lys 


Arg 


Leu 


Glu 
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Glu 


Gin 


Tyr 


Glu 


610 


i 








615 










620 










Asp Ala 


Cys 


Arg" 


Leu 


Leu 


Gin 


Ser Ala Lys 


Val 


Ala 


Leu 


Gin 


Asp Arg 


625 








630 










635 










640 


Glu Lys Lys' Leu Ala Glu Ala Tyr Glu Qln 


He 


Glu 


Lys 


Asp 


Leu 


Val 








645 










650 










655 




Leu Leu 


Gly Ala 


Thr 


Ala 


Val 


Glu Asp Arg 


Leu 


Gin 


Glu 


Lys 


Ala 


Ala 






660 






i 

i 




665 










670. 






Asp Thr 


He 


Glu 


Ala 


Leu 


Gin 


Lys 


Ala 


Gly 


He 


Lys 


Val 


fcrp 


Val 


Leu* 


675 










680 










685 








Thr Gly 


Asp 


Lys 


Met 


Glu 


Thr 


Ala 


Ser 


Ala 


Thr 


Cys 


Tyr 


Ala 


Cys 


Lys 


690 










695 










700 










Leu Phe Arg 


Arg 


Ser 


Thr 


Gin 


Leu 


Leu 


Glu 


Leu 


Thr 


Thr 


Lys 


Lys 


Leu 


705 








710 










715 










720 


Glu Glu 


Gin 


Ser 


Leu 


His 


Asp 


Val 


Leu 


Phe 


Asp 


Leu 


Ser 


Lys 


Thr 


Val 








725 










730 










735 




Leu Arg 


Cys 


Ser 


Gly 


Ser 


Met 


Thr 


Arg 


Asp 


Ser 


Phe Ser 


Gly 


Leu 


Ser 






740 










745 










750 






Thr Asp 


Met 


His 


Asp Tyr 


Gly 


Leu 


He 


He 


Asp 


Gly Ala 


Ala 


Leu 


Ser 




755 
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Leu lie 


Met 


Lys 
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Arg 


Glu 
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Gly 


Ser 


Ser 


Ser 


Gly 


Asn 
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Arg 
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78 0 
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He 
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Leu 


Ala 


He Gly Asp 


Gly 


Ala 
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Asp 
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830 






Val Ser 
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Glu 
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His 


Val 


Gly 
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Gly 


Val 
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Gly 


Lys 
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Glu Gly Arg Gin Ala 


Ala 
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Ser 
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He 


Pro 
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Phe 


850 
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Phe 
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Phe 
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Gin 


Phe 


Phe 
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Gly 


Phe 


Ser 


Gin 


Gin 
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Thr Leu 
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Asp 
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Ala 
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Thr 


Leu 
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He 


Ser 


Phe 
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He 
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Leu 


Tyr 


Ser 
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Met 


Glu 


Gin 


His 


Val 


Gly 


lie 


930 
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Asp Val 
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Lys 
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Pro 
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Leu 


Tyr Arg Asp 


He 


Ala 
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Asn 


945 








950 










955 










960 
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Ala Leu Leu Arg Trp Arg Val Phe lie Tyr Trp Thr Phe Leu . Gly Val 

965 970 .", v -975 

Phe Asp Ala Leu Val Phe Phe Phe Gly Ala Tyr Phe li e Phe Glu. Asn 

980 : FS3 7" " " ' S91T . 

Thr Thr Val Thr lie Asn Gly Gin Met Phe Gly Asn Trp Thr Phe Gly 

'995 1000 1005 

Thr Leu Val Phe Thr Val Met Val Leu Thr Val Thr Leu Lys Leu Ala 

1010 1015 lb20 

Leu Asp Thr His Tyr Trp Thr Trp lie Asn His Phe Val lie Trp Gly 
1025 1030 1035 1040 

Ser Leu Leu Phe Tyr lie. Ala Phe Ser Leu' Leu Trp Gly Gly Val lie 

1045 . ' 1050 ] , 1055 

Trp Pro Phe Leu Ser Tyr Gin Arg Met Tyr Tyr Val Phe lie Ser Met 

1060 1065 1070 

Leu Ser Ser Gly Pro Ala Trp Leu Gly lie lie Leu Leu Val Thr Val 

1075 1080 / 1085 

Gly Leu Leu Pro Asp Val Leu Lys Lys Val Leu'Cys Arg Gin Leu Trp 

' 1090 1095' 1100 

Pro Thr Ala Thr* Glu Arg Thr Gin Asn lie Gin His Gin Asp Ser lie 
1105 U-10. 1115 1120 

i Ser Glu Phe Thr Pro Leu Ala Ser Leu Pro Ser Trp Gly Ala Gin Gly 
1125 1130 1135 

Ser. Arg Leu Leu Ala Ala Gin Cys Ser Ser Pro Ser Gly Arg Val Val 

1140 1145 , .1150 

Cys Ser Arg Trp Glu Ser Glu Glu Cys Pro Val Leu Pro Leu His Pro 

1155 1160 1165 

Gly Leu Pro His Lys Ala Arg Tyr Gly Cys Cys Arg Sjer Ser Leu Glu 

1170 1175 1180 

Met Pro Thr 
1185 



<210> 35 
<211> 1508 
<212> PRT 

<213> Mus musculus 
<400> 35 

Met Glu Arg Glu Leu Pro Ala Ala Glu Glu Ser Ala Ser Ser Gly Trp 

1 5 10 15 

Arg Arg Pro Arg Arg Arg Arg Trp Glu Gly Arg Thr Arg Thr Val Arg 

20 25 30 

Ser Asn Leu Leu Pro Pro Leu Gly Thr Glu Asp Ser Thr lie Gly. Ala 

35 40 45 

Pro Lys Gly Glu Arg Leu Leu Met Arg Gly Cys lie Gin His Leu Ala 

50 55 60 

Asp Asn Arg Leu Lys Thr Thr Lys Tyr Thr Leu Leu Ser Phe Leu Pro 
65 70 15 .80 

Lys Asn Leu Phe Glu Gin Phe His Arg Leu Ala Asn Val Tyr Phe Val 

85 90 95 

Phe lie Ala Leu Leu Asn Phe Val Pro Ala Val Asn Ala Phe Gin Pro 

100 105 110 

Gly Leu Ala Leu Ala Pro Val Leu Phe lie Leu Ala Val Thr Ala lie 

115 120 125 

Lys Asp Leu Trp Glu Asp Tyr Ser Arg His Arg Ser Asp His Glu lie 

130 135 140 

Asn His Leu Gly Cys Leu Val Phe Ser Arg Glu Glu Lys Lys Tyr Val 
145 150 155 160 

Asn Arg Tyr Trp Lys Glu lie Arg Val Gly Asp Phe Val Arg Leu Cys 
165 170 175 
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Cys Asn Glu lie lie Pro Ala Asp lie Leu Leu Leu Ser .Ser. Ser Asp 



180 185 .', 190 
Pr o Asp Gly Leu Gys His lie Glu Thr Ala Asn Leu Asp Gly Glu Thr 
195 "■' ' ~ZUG ? 7~' : '2*05 
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Ser 
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Val 
330 


Gly 
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335 


Val 
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Ser 
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Glu 
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Glu 
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He 


Gly 


Asn 


Leu 


Ser 


Thr 


Ser 


Lys 


Ser 


Ser His 


Lys 


625 










630 










635 








640 


Ser 


Gly 


Ser 


Ala 


Phe 
645 


Leu 


Pro 


Ser 


Leu 


Ser 
650 


Gin 


Asp 


Ser 


Met Leu 
655 


Leu 
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Giy Leu Glp Glu Lys* Leu Gly Glh Thr Ala Pro Ser He Ala Ser Asn 

660 rt . " 665 ( V... 670 

. Gly Tyr Ala Ser Gin' kla Gly"Gln Glu £lu Ser. Trp Ala Ser Asp Cys 

"F75 " ~6UI5 : 685 

thr Thr Asp Gin l^ys Cys Pro Gly Glu . Gin Arg Glu Gin Gin Glu > (Sly 

690 , 695 ' 1 700 . 

Glu Leu Arg Tyr Glu Ala Glu Ser Pro Asp Glu. Ala-Ala Leu Val Tyr' 
705 710 " 715 720 

Ala Ala Arg Ala Tyr Asn Cys Ala Leu Val Asp Arg Leu His Asp Gin 

725 730 735 

Val Ser Val Glu Leu Pro His Leu- Gly Arg Leu Thr' Phe Glu Leu Leu • 

740 . 745 .. \ 750 , 

His Thr Leu Gly Phe Asp Ser lie Arg Lys Arg Met Ser Val Val lie 

755 , 760 765 

Arg His Pro Leu Thr Asp Glu lie Asn Val Tyr Thr Ly$ Gly Ala Asp 

770 V * 775 , 780 

Ser Val Val Met Asp Leu Leu Leu Pro Cys Ser Ser' Asp Asp Ala Arg 
785 790 795 ' 800 

Gly Arg rfis Gin" Lys Lys lie Arg Ser. Lys Thr Gin Asn Tyr Leu Asn 

805 . 810 . . 815 

Leu Tyr Ala Val Glu Gly Leu Arg Thr Leu Cys lie Ala Lys Arg Val 

820 825. , 830 

Leu Ser Lys Glu Glu Tyr Ala Cys Trp Leu Gin Ser His lie Glu Ala 

835 , 1 B40 845 

Glu Ala Ser Val Glu Ser Arg Glu Glu Leu Leu Phe Gin 'Ser Ala VajL 

850 ' . 855 860 

Arg Leu Glu Thr Asn Leu His Leu Leu Gly Ala Thr Gly lie Glu Asp 
865 870 875 880 

Arg Leu. Gin Glu Gly Val Pro Glu Thr lie Ala Lys Leu Arg Gin Ala 

885 890 895 

Gly Leu Gin lie Trp Val Leu Thr Gly Asp Lys Gin Glu Thr Ala lie 

900 905 < 910 

Asn lie Ala Tyr Ala Cys Lys Leu Leu Asp His Gly Glu Glu Val lie 

915 920 925 

Thr Leu Asn Ala Asp Ser Gin Glu Ala Cys Ala Ala Leu Leu Asp Gin 

930 335 940 

Cys Leu Ser Tyr Val Gin Ser Arg Asn Pro Arg Ser Thr Leu Gin Asn 
94 5 950 • 955 960 

Ser Glu Ser Asn Leu Ser Val Gly Phe Ser Phe Asri Pro Val Ser Thr 

965 970 975 

Ser Thr Asp Ala Ser Pro Ser Pro Ser Leu Val lie Asp Gly Arg Ser 

980 985 990 

Leu Ala Tyr Ala Leu Glu Lys Ser Leu Glu Asp Lys Phe Leu Phe Leu 

995 1000 1005 

Ala Lys Gin Cys Arg Ser Val Leu Cys Cys Arg Ser Thr Pro Leu Gin 

1010 1015 t - 1020 

Lys Ser Met Val Val Lys Leu Val Arg Ser Lys Leu Lys Ala Met Thr 
1025 1030 1035 1040 

Leu Ala lie Gly Asp Gly Ala Asn Asp Val Ser Met lie Gin Val Ala. 

1045 1050 1055 

Asp Val Gly Val Gly lie Ser Gly Gin Glu Gly Met Gin Ala Val Met 

1060 1065 1070 

Ala Ser Asp Phe Ala Val Pro Arg Phe Arg Tyr Leu Glu Arg Leu Leu 

1075 1080 1085 

lie Val His Gly His Trp Cys Tyr Ser Arg Leu Ala Asn Met Val Leu 

1090 1095 1100 

Tyr Phe Phe Tyr Lys Asn Thr Met Ser Val Gly Leu Leu Phe Trp Phe 
1105 1110 1115 1120 

Gin Phe Tyr Cys Gly Phe Ser Ala Ser Ala Met lie Asp Gin Trp Tyr 
1125 1130 1135 
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Leu lie Phe Phe Asn Leu Leu Phe Ser Ser Leu Pro Gin Leu Val Thr 

1140 ■ h 1145 , u ?0 

Gly Val Leu Asp Lys Asp Val Pro Ala Asp Met Leu Leu Arg Glu Pro ' 

~ 1X55 ; ; llbu " 1 165 

Gin Leu Tyr Lys Ser Gly Gin Asn Met. Glu Glu Tyr Arg Pro Arg 'Ala 

H70 , 1175 * ' ' use • 

Phe Trp Leu Asn Met Val Asp Ala Ala Phe Gin Ser -Leu Val Cys Phe 
1185 1190 ' H95 1200 

Phe He Pro Tyr Leu Ala Tyr Tyr Asp Ser Asp Val Asp Val Phe Thr 

1205 1210 1215 

Trp Gly Thr Pro Val Thr Ala He Ala Leu Phe Thr* Phe Leu Leu His • 

1220 1225 , ' , 1230 * 

Leu Gly lie Glu Thr Lys Thr Trp Thr Trp Leu Asn Trp Leu Ala Cys I 

1235 1240 1245 

Gly Phe Ser Thr Phe Leu Phe Phe Ser Val Ala Leu He Tyr Asn Thr 

1250 ( 1255, 1260 

Ser Cys Ala Thr Cys Tyr Pro Pro Ser Asn Pro Tyr Trp Thr Met Glh 
1265 . § ■ 1270 1275 ' 1280 

Thr Leu Leu Gly Asp Pro Leu Phe Tyr Leu Thr Cys Leu He Ala Pro 

1285 1290 1295 ' 

lie Ala Ala Leu Leu Pro Arg Leu Phe Phe Lys Ala Leu Gin Gly Ser 

1300 1305 ' . 1310 

Leu Phe Pro Thr .Gin Leu Gin Leu Gly Arg Gin Leu Ala Lys Lys Pro 

1315 . 1 1320 1325 

Leu Asn Lys Phe Ser Asp Pro Lys Glu Thr Phe ' Ala' Gin Vsiy Gin Pro 

1330 » , 1335 1340 

Pro Gly His Ser Glu Thr Glu Leu Se,r Glu Arg Lys Thr Met Gly Pro 
1345 1350 1355 1360 

Phe Glu Thr Leu Pro Arg Asp Cys Ala Ser Gin Ala Ser Gin Phe Thr 

1365 ,. 1370 1375. 

Gin Gin Leu Thr Cys Ser Pro Glu Ala Ser Gly Glu Pro Ser Ala Val 

138.0 1385 ■ 1390 

Asp Thr Asn Met Pro Leu Arg Glu Asn Thr Leu Leu' Glu Gly Leu Gly 

1395 1400 1405 

Ser Gin Ala Ser Gly Ser Ser Met Pro Arg Gly Ala lie Ser Glu Val 

1410 1415 1420 

Cys Pro Gly Asp Ser Lys Arg Gin Ser Ser Ser Ala Ser Gin Thr Ala 
I 425 1430 • 1435 1440 

Arg Leu Ser Ser Leu Phe His Leu Pro Ser Phe Gly' Ser Leu Asn Trp 

1445 1450 1455 

lie Ser Ser Leu Ser Leu Ala Ser Gly Leu Gly Ser Val Leu Gin Leu 

1460 1465 1470 

Ser Gly Ser Ser Leu Gin Met Asp Lys Gin Asp Gly Glu Phe Leu Ser 

1 47 5 1480 1485 

Asn Pro Pro Gin Pro Glu Gin Asp Leu His Ser Phe Gin Gly Gin Val 

14 90 . 14 95 1500 

Thr Gly Tyr Leu 
1505 



<210> 36 
<211> 1095 1 
<212> PRT 

<213> Mus musculus 
<220> 

<221> VARIANT 

<222> 801, 1005 

<223> Xaa = any amino acid 
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<400> 36 
Met Pro Leu Met 
.. 1 ■_ ' 

Tyr His Thr Leu 
■20 

Glu Trp Phe Val 
35 

Asp Trp Leu lie 
50 

Thr Val Trp Leu 
65 

Asn Ser lie Lys 
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Met 
5 

"Fro 



Ser Glu Glu Gly 
Arc) Ala 



Val Leu Tyr Glu 
100 

Val Ser Cys Ser 
115 

Thr Tyr Trp Ala 
130 

Glu Ala lie Asp 
145 

Ser Gin Leu Tyr 

Ser Ser Asp He 
180 

Arg He Pro Ser 
195 

Ser Cys Phe He 
210 

Leu Lys Val Ala 
225 

Leu Phe Ser He 

He His Ser Phe 
260 

He His Glu Ser 
275 

Val Ala Ser Gly 

. 290 
Thr Arg Ser Val 
305 

Leu Asp Leu Glu 

Val Val Leu Ser 
340 

Trp Tyr Arg Asn 
355 

Pro He Ser Leu 
370 

Trp Met He Met 
385 

Ser Thr lie Pro 

Lys Thr Gly Thr 
4 20 

Leu Gly Thr Val 

435 . 
His Val Leu Asn 
450 

Asn Pro Ser Ser 
4 65 



Cys 

Asn 

Gly 

Asn 
85 
Gin 

Gin 

Pro 

Glu 

Ser 
165 
Gin 

Asp 

Arg 

Val 



Arg lie 
'25 

Gly Gly Trp Lys 
40 

Gin Arg 




Ser 
245 
Glu 



Val Cys 

55 
Cys Pro 
70 

Gln f Lys 

Phe Lys 

Phe VaU 

Leu Gly 
135 
Phe Arg 
150 

Lys Leu 

Val Gly 

Met Val 

Thr Asp 
215 
Ser Cys 
230 

Ala Tyr 



Leu 

Thr 

Met 

Leu 
325 
Val 

Leu 

Arg 

Lys 

Glu 
405 
Leu 

Ser 

Ser 

Ala 



Gly Thr 
Ser He 



Val He 
295 
Asn Thr 
310 

Asn Gin 

Val Met 

Phe Arg 

Val Asn 
375 
Asp Glu 
390 

Glu Leu 

Thr Gin 

Tyr Gly 

Tyr Leu 
455 
Pro Leu 
470 



Glu Lys 

Tyr Asn 

Phe Phe 
105 
Pro Ala 
120 

Phe Val 

Arg Phe 

Thr Val 

Asp Leu 
185 
Phe Leu 
200 

Gin Leu 

Thr Gin 

Val Tyr 

Phe Thr 
265 
Glu Asn 
280 

Gly Val 



Phe Leu Cys 

Lys Lys Glu 
60 

Cys Glu Glu 

. ,75 

Val Phe Thr 
90 1 

Leu Asn Leu 

Leu Lys .He 

Leu Ala Val 
140 

Gin Arg Asp 

155 
Arg Gly Lys 
170 

He lie Val 

Arg Thr Ser 

Asp Gly Glu 
220 

Arg Leu Pro 

235 
Ala Gin Lys 
250 

Arg Glu Asp 
Thr Leu Trp 



Ser Asn 

Leu Thr 

Val Thr 
345 
Phe Leu 
360 

Leu Asp 

Asn He 

Gly Arg 

Asn Glu 
425 
Thr Asp 
440 

Gin Val 
Arg Arg 



Val He Tyr 
300 

Pro Asn Asn 
315 

Lys Ala Leu 
330 

Leu Gin Gly 

Leu Leu Phe 

Met Gly Lys 
380 

Pro Gly Thr 

395 
Leu Val Tyr 
410 

Met Val Phe 

Thr Met Asp 

His Ser Gin 
460 

Ser Gin Ser 
475 
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30 

Thr Ser. 
45 . 

Leu Lys 

Lys His 

Phe lie 
i 

Tyr Phe 
110 
Gly Tyr 
125 

Thr lie 

Lys Glu 

Val Gin 

Glu Lys 
• 190 
Glu Lys 
205 

thr Asp 

Ala Leu 

Pro Gin 

Ser Asp 
270 
Ala Ser 
285 

Thr Gly 



Lys Val 
Phe Leu 



Phe Ala 
350 
Ser Tyr 
3 65 

Ala Ala 

Val Val 

Leu Leu 

Lys Arg 
430 
Glu lie 
445 

Pro Ser 
Ser Thr 



Cys Cys 

Ala Arg 

Pro Arg 
80 

Pro Gly, 
95 

Leu Val 

Leu Tyr 

Ala Arg 

Met Asn 
1 60 
Val Lys 
175 

Asn Gin 

Ala Gly 

Trp Lys 

Gly Asp 
240 
Leu Asp 
255 

Pro Pro 

Thr lie 

Lys Glu 

Gly Leu 
320 
Ala Leu 
335 

Gly Pro 



He He 

Tyr Gly 

Arg Thr 
400 
Thr Asp 
415 

Leu His 

Gin Ser 

Gly His 

Pro Lys 
4 80 
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Val Lys Lys Ser Val Ser Ser Arg lie His Glu Ala Val Lys Ala lie 
---4 8 5 * (i 490 , ., -.4.95.^... 

Ala Leu. Cys. His. Asn' Val Thr Pro Val tfyr Glu Ala Arg Ala Gly tie 
— : 500 5^5 — : — — 5r±fl- 



Tnr Gly 


Glu Thr Glu 


Jrne 


Ala 


Glu 




Asp 


Gin Asp Phe 


oer 


ASp 


GlU 




515 










520 




i 






525 








Asn Arg 


Thr 


Tyr 


Gin 


Ala 


Ser 


Ser . 


Pro 


Asp 


Glu 


Val- 


Ala 


Leu 


Val 


Arg 












535 








540 










Trp Thr 


Glu 


Ser 


Val 


0>ly 


Leu 


Thr 


Leu 


vai 


Ser 


Arg 


Asp 


Leu 


Ala 


Ser 


C A C 

545 








CCfi 










555 










560 


Mot- filn 

Met bin 


Leu Lys 


Thr 


Pro 


Ser Gly 


tain 


vai 


Leu 


Thr 


Tyr 


Cys 


lie 


Leu 








565 


V 






i 


3 /u 


• 






ETC 

p / o 




CjtJLn Met 


Phe 


Pro 


Phe 


i nr 


Ser Glu 


Ser 


Lys 


Arg 


Met 


Gly 


He 


lie 


vai 






580, 










C D C 

585 










590 






Arg Asp 


Glu 


Ser 


Thr 


Ala 


Glu 


lie 


Thr 


Phe 


Tyr 


Met 


Lys 


Gjiy 


Ala 


Asp 




594 










,600 










605 




Val Ala 


Met 


Ser 


■ . t 
Thr 


He 


Val 


Gin 


Tyr 


Asn 


Asp 


Trp Leu 


Glu 


Glu 


Glu 


610 










615 










620 










Cys Gly 


Asn Met 


Ala 


Arg 


Glu 


Gly 


Leu- 


Arg 


Thr 


Leu 


Val 


Val 


Ala 


Lys 


625 








630 










635. 










64 0 


Arg Thr 


Led 


Thr 


Glu 


Glu 


Gin 


Tyr 


Gin 


Asp 


the 


Glu 


Ser Arg 


Tyr 


Ser 








645 










650 










655 




Gin Ala 


Lys 


Leu 
660 


Ser 


He 


His 


Asp 


Arg 
665 


Ala 


Leu 


Lys 


Val 


Ala 
670 


Ala 


Val 


Val Glu 


Ser 
675 


Leu 
t 


Glu 


Arg 


Glu 


Met 
680 


Glu 


Leu 


Leu 


Cys 


Leu 
685 


Thr 


Gly 


Val 


Glu Asp 


Gin 


Leu 


Gin 


Ala 


Asp 


Val 


Arg 


Pro 


Thr 


Leu 


Glu 'Met 


Leu 


Arg 


690 










695 








700 










Asn Ala 


Gly 


He 


Lys 


He 


Trp 


Met 


Leu 


Thr 


Gly 


Asp 


Lys 


Leu 


Glu 


Thr 


705 








710 










715 










720 


Ala Thr 


Cys 


He 


Ala 
725 


Lys 


Ser 


Ser 


His 


Leu 
730 


Val 


Ser 


Arg 


Thr 


Gin 
735 


Asp 


He His 


Val 


Phe 
74 0 


Arg 


Pro 


Val 


Thr 


Ser 
745 


Arg 


Gly 


Glu' 


Ala 


His 
750 


Leu 


Glu 


Leu Asn 


Ala 
755 


Phe 


Arg 


Arg 


Lys 


His 
7 60 


Asp 


Cys 


Ala 


Leu 


Val 
765 


lie 


Ser 


Gly 


Asp Ser 


Leu 


Glu 


Val 


Cys 


Leu 


Arg 


Tyr 


Tyr 


Glu 


His 


Glu 


Leu 


Val 


Glu 


770 










775 










780 










Leu Ala 


Cys 


Gin 


Cys 


Pro 


Ala 


Val 


Val 


Cys 


Cys Arg Gys 


Ser 


Pro 


Thr 


785 








790 




• 






795 










800 


Xaa Lys 


Ala 


His 


He 
805 


Val 


Thr 


Leu 


Leu 


Arg 
810 


Gin 


His 


Thr 


Arg 


Lys 
815 


Arg 


Thr. Cys 


Ala 


He 


Gly Asp 


Gly 


Gly 


Asn 


Asp 


Val 


Ser 


Met 


He 


Gin 


Ala 






820 










825 










830 






Ala Asp 


Cys 


Gly 


He 


Gly 


He 


Glu 


Gly 


Lys 


Glu 


Gly Lys 


Gin 


Ala 


Ser 




835 










840 










8 45 








Leu Ala 


Ala Asp 


Phe 


.Ser 


He 


Thr 


Gin 


Phe 


Arg 


His 


He Gly 


Arg 


Leu 


O C C\ 










855 










860 










Leu Met 


Val 


His 


Gly Arg 


Asn 


Ser 


Tyr 


Lys 


Arg 


Ser 


Ala 


Ala 


Leu 


Gly 


ore 
ODD 








870 










875 










880 


oin irne 


Val 


Met 


His 
885 


Arg 


Gly 


Leu 


He 


He 
890 


Ser 


Thr 


Met 


Gin 


Ala 
895 


Val 


Phe Ser 


Ser 


Val 
900 


Phe 


Tyr 


Phe 


Ala 


Ser 
905 


Val 


Pro 


Leu 


Tyr 


Gin 
910 


Gly 


Phe 


Leu Met 


Val 


Gly Tyr Ala 


Thr 


He 


Tyr 


Thr 


Met 


Phe 


Pro 


Val 


Phe 


Ser 




915 










920 










925 








Leu Val 


Leu Asp Gin Asp 


Val 


Lys 


Pro 


Glu 


Met 


Ala 


He 


Leu 


Tyr 


Pro 


930 










935 










940 










Glu Leu 


Tyr Lys 


Asp 


Leu 


Thr 


Lys 


Gly Arg 


Ser 


Leu 


Ser 


Phe 


Lys 


Thr 


945 








950 










955 










960 
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Phe Leu lie Trp Val Leu lie Ser lie Tyr Gin Gly Gly lie Leu Met 

965 . : 970 • 975 

Tyr Gly Ala Leu Leu Leu Phe Glu Asp Glu Phe Val His Val Val Ala 

; 980 - r 985 ■ : ~ : 99D~ ■' 

lie Ser^ Phe Thr Ala Leu lie Leu Thr Glu Leu Leu Xaa Val Ala Leu 

995 1000 , 1005 

Thr He Arg Thr Trp His Trp Leu Met Val Val Ala Glu Phe Leu Ser 

1010 1015 1020 

Leu Gly Cys Tyr Val Ala Ser Leu Ala Phe Leu Asn Glu Tyr Phe Gly 
1025 1030 , 1035 1040 

He Gly Arg Val Ser Phe Gly Ala Phe Leu Asp Val' Ala Phe He Thr 

1045 • ' 1050 , ■ 1055 

Thr Val Thr Phe Leu Trp Lys Val Ser Ala lie Thr Val Val Ser Cys 

1060 1065 1070 

Leu Pro Leu Tyr Val Leu Lys Tyr Leu Lys Arg Lys Leu Ser Pro Pro 

1075 ** 1080 - ' 1085 

'Ser Tyr Ser Lys Leu Ser Ser 
1090 1095 



■ <210> 37 
<211> 9 
<212> PRT 

<213> Artificial Sequence . 1 

<220> 

<223> domain • 
<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa * Asp, Asn, or Ser 
m <220> 

" <221> VARIANT 
<222> (2) 

<223> Xaa = Gin, Glu, Asn, or Arg 
<220> 

<221> VARIANT 
<222> (3) 

.<223> Xaa = Ser or Ala 
<220> 

<221> VARIANT . 
<222> (4) 

<223> Xaa = Leu, He, Val. Ser, Ala, or Asn 
<220> 

<221> VARIANT 
<222> (5) 

<223> Xaa = Leu, He,, or Val 
<220> 

<221> VARIANT 
<222> (6) 

<223> Xaa » Thr, Ser, or Asn 
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<226>; : : ■ 
<2 2 1>... VARIANT 
.<22.2> :.(9) 

<223> Xaa = Sex , ux Asa — 

<400> 37 * 
Xaa Xaa Xaa Xaa Xaa Xaa Gly Glu Xaa * - 

1 5 • 



<210> 38 ' 

<211> 10 v 

<212> PRT 

<213> Artificial Sequence 

<220> 1 
,<223> domain 

<220> 

<221> VARIANT 
, <222> (1) 
<223> Xaa = Leu, lie, or Val 

<220> 

<221> VARIANT 
<222> (2) 

<223> Xaa - Cys, Ala, Met, or Leu 
<220> 

<221> VARIANT 
<222> (3) 

<223> Xaa « Ser, Thr, Phe, or Leu 
<220> 

<221> VARIANT 
<222> (9) 

<223> Xaa = Leu or lie 
<400> 38 

Xaa Xaa Xaa Asp Lys Thr Gly Thr Xaa Thr 
1 5 10 



<210> 39 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> domain 
<220> 

<221> VARIANT 
<222> (1) 

<223> Xaa = Thr, lie, or Val 
<220> 

<221> VARIANT 
<222> (5) 

<223> Xaa = any amino acid 
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<220> t 

<221> VARIANT „ i 

<222> (8) m . . " ' 

•< 223> Xaa = Ala, Ser, or Gly 

' ; i s , r 

<220> 

<221> VARIANT 
<222> (10) 

<223> Xaa ■=* Ala, Ser, or Val 

<400> 39 1 
Xaa Gly Asp Gly Xaa Asrv Asp Xaa Pro Xaa Leu- 
1 5 ' • ■ 10 



<210> 40 1 ' . 

<211> 7 " , ' 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> domain 

<220> i V • 

<221> VARIANT 
<222> (6) 

<223> Xaa - Leu, lie, Val, or Met 
<220> 

<221> VARIANT 
<222> (7) 

<223> Xaa = Thr or lie 
<400> 40 

Asp Lys Thr Gly Thr Xaa Xaa 
1 5 
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